Gene Franean1_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2638 
Symbol 
ID5671032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3116319 
End bp3117899 
Gene Length1581 bp 
Protein Length526 aa 
Translation table11 
GC content70% 
IMG OID641241554 
Producthypothetical protein 
Protein accessionYP_001506974 
Protein GI158314466 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.316802 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAC CCAGCCACCG CGTCGAGCAG GAGGAGCTGC GGGCGCGGAT GCGCGCGGTC 
GGTATGTCCC ACGACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCACTA CCGTCTCCGT
GCCGCTCACC GGGTCGCGCA CGGCTGGACC CAGCAGCAGG CCGCAAACCA CATCAACGCC
CACGCCGCCC GCACCGGCCT CGACCCCCAG GGCACTGCCC CCATGACTGC CCCCCGGCTG
TCGGAGCTGG AGAACTGGCC GCTACCGAAC AACCGCCGCC GGCCCACCCC CCAGCTCCTC
GCCCAACTCG CCGAGGTCTA CGACACCAGC ATCCACAACC TCATCGACCT CGACGACCGC
GAACACCTCA CCCCCGCCGA CACACTCCTC ATAGCCCAGA TCCGCAAGGG CCTCCCACAG
CCAACGCAGG ACGGTTCCGC ACCCGACGCC ACCGCGATCA GCAGCCGGGG ACCGCAGATC
GATCTGCCTC GCATCGGACG CTCGCGTGTC CCTGCCGTTC CGATCGGTGT ATGCGGCGCC
GTCCCGACCG AGATCGACCC CGGTGCGGAC ATCGACGCGG ACACGGCACT GCGCCGCGCG
CACGAATGGC TGGTGACCGA ACCACCGCAG GCCGTGGAGA CCCGCACGGG ACGGCGGATC
GGGGAGGCGT TCACCCGCAA GGTCGAGGGC CGCGTCGCAC AACTGCGCCG CCTGGACGAC
TTCGTTGGCG GCCGGGACCT GTATGAGCTG GTCGCGCGGG AAGTCGCCGC CACGACCGCC
GTGCTCGACG ACGCCGCCTA CGACGAGCAT CTCGGACGTC GACTGCGGTC CGCTGTCGCG
GAACTGTGCC AGCTCGCCGG CTGGGTCGCG ATGGACGCCG GCCACACCCA GGCCGCGCGG
CGCTTCTACC TCGACGGGGT GAAAGCCGCG CACGCCGCCG GCAACAGCCC GGTCGCGGCG
AACCTGATCT CGACGTTGAG CTACCAGTTC GCCAACCAGC ACGACCCGCG CACCGCCATC
CTGCTGGCCC GCACCGCCCT GCGCGGAGCG GAGAACTCCG CGACGCCGGC CACCCTGGCA
CTGTTGTACG AACGCATCGC CTGGGCACAC GCGAAAGCCG GCGACCGGTC CGCCACGGAG
AAGGCACTCG CCGCCGTGGA GCGTCACTAC GACCAGCGAC GTCCCGACGA CGAACCGACC
TGGGTGTACT GGCTCGACGA CAACGAGATC CAGGTGATGG CCGGCCGCTG CTACGTCGAA
CTCGGCCTCC CGCAGCACGC CGAGCCGCTG CTGGTCGACG CGGTCGCCCG CTGCGACGAA
GACCACGCCC GCGAAGCCGC CCTTTACCGC TCCTGGCTCG CCGAGGCGTA CCTGCAGACA
GGCGACATCG GCCGGGCCGT CGAAGAAGCC ACGCATGTCG TCCGGCTCGA CGCCCGCGCC
GGATCAGCAC GCACCTCCGA CCGGGTCCAA CACCTGCGAG CCGGCCTCGC CGCGTTTCGC
ACCGACCCGG CAGTCCGCGC CTTCGAGGAC CTCTACCAAT CCGAAGCGGA TCTTCCAAGC
AACCTGCGTC GACCGAACTG A
 
Protein sequence
MATPSHRVEQ EELRARMRAV GMSHDEIAIE FARRYHYRLR AAHRVAHGWT QQQAANHINA 
HAARTGLDPQ GTAPMTAPRL SELENWPLPN NRRRPTPQLL AQLAEVYDTS IHNLIDLDDR
EHLTPADTLL IAQIRKGLPQ PTQDGSAPDA TAISSRGPQI DLPRIGRSRV PAVPIGVCGA
VPTEIDPGAD IDADTALRRA HEWLVTEPPQ AVETRTGRRI GEAFTRKVEG RVAQLRRLDD
FVGGRDLYEL VAREVAATTA VLDDAAYDEH LGRRLRSAVA ELCQLAGWVA MDAGHTQAAR
RFYLDGVKAA HAAGNSPVAA NLISTLSYQF ANQHDPRTAI LLARTALRGA ENSATPATLA
LLYERIAWAH AKAGDRSATE KALAAVERHY DQRRPDDEPT WVYWLDDNEI QVMAGRCYVE
LGLPQHAEPL LVDAVARCDE DHAREAALYR SWLAEAYLQT GDIGRAVEEA THVVRLDARA
GSARTSDRVQ HLRAGLAAFR TDPAVRAFED LYQSEADLPS NLRRPN