Gene Franean1_2792 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2792 
Symbol 
ID5671181 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3300183 
End bp3301268 
Gene Length1086 bp 
Protein Length361 aa 
Translation table11 
GC content64% 
IMG OID641241701 
Producthypothetical protein 
Protein accessionYP_001507121 
Protein GI158314613 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTACGACG GGGGCGACTA CGAACTCGTC TGGCCACGGC ACCTGTTCGT CCACGAAGCG 
AGCAACCTGC TCAACCACCG CAGGATCCAC CGTGACTGGG ATGACCGCTG CCTACTACTT
CTCGATCACG CGTTCGCGGG CCCTACTCCC CAGGACGACT TCCGTCAGGC AGCCGCACAG
TCCCCGCCTC CACGTGGCCT GAGCAACGGG CAGGCATTCC TCCGCGACCT GATGTCCAGC
TCCGAACAGC TCCGCGAGAC CACAACACCG CAGTACCGCC CGTACTGGTC CGAACGCCGG
GCGGGTACCT CCCCTGACCG TGCCGGTCTG CGTGCCACAG CCCGCCAGTT CATCGATATC
GTCAGCCATC TCAACAACCA CGGATACTTC GAGCAGGCGT TCGGTAAGGA CTGCGTCGAC
GACCCCAGCG AGATCGACCC CTCGGCCGTC ATCGAGCGCG CCATCGGCGC CGCAGACCTG
TGGCCGCTGA CGCCGGACCG ACTCGCACAG AACATCGACG TGTTCTGCGA CGTGGTCGAA
GTGCTCCACG ATCTGGTAGC GCGCCCCCGA TCTCGCGGAC TACACGACTA CGACGGATGC
GGCTGGCACT ACCGCGATTT CTCTCCCGCC ACAGGCCGCG TCGTCTACCG GTGGCGCGTC
AATGGTCTGC TCGAACGAAG CGACCTCGGC CTCCACCTCG CGGACGAAGG CGAAGACGTC
GGTCGCCTGG TCACCAGCAC CGATCCCGCC CGATCGGACC TCCTGAGCCG CATGGCCCAG
CGAGAAAGCC CGGCCGCTGA CCGGCTCCGC CACGCCATCA GCCTGTACCG GGCACGGCAC
GCCGACGAAC ACACCAAGAG ATCCGCGGTC GTCGTCCTCA GTGGCGTTCT CGAAGAACGC
CGACAGCTGA TCAAAGATGA GCTGCTCAGC AAGGACGAAG GTGACCTCTT CACGATCGCG
AACAAGTTCG CGATCCGGCA CCAGAACGAA CAACAGAAAA CCGACTATAG CGCCGAGTTC
CTCGACTGGA TCTTCTGGTG GTACCTCGCG ACGATCGAGC TCACCGACCA TCTCCTCGCA
CGCTAA
 
Protein sequence
MYDGGDYELV WPRHLFVHEA SNLLNHRRIH RDWDDRCLLL LDHAFAGPTP QDDFRQAAAQ 
SPPPRGLSNG QAFLRDLMSS SEQLRETTTP QYRPYWSERR AGTSPDRAGL RATARQFIDI
VSHLNNHGYF EQAFGKDCVD DPSEIDPSAV IERAIGAADL WPLTPDRLAQ NIDVFCDVVE
VLHDLVARPR SRGLHDYDGC GWHYRDFSPA TGRVVYRWRV NGLLERSDLG LHLADEGEDV
GRLVTSTDPA RSDLLSRMAQ RESPAADRLR HAISLYRARH ADEHTKRSAV VVLSGVLEER
RQLIKDELLS KDEGDLFTIA NKFAIRHQNE QQKTDYSAEF LDWIFWWYLA TIELTDHLLA
R