Gene Franean1_2840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2840 
Symbol 
ID5671229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3356044 
End bp3357555 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content70% 
IMG OID641241749 
Producthypothetical protein 
Protein accessionYP_001507169 
Protein GI158314661 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.43612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTCGAC ACAGTCTGCG GGTGGAGCAG GATGAGCTGC GGGCCCGGAT GCGCACGGTC 
GGCATGTCCC ACGACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCACTA CCGCCCCCGC
GCCGCCCATC GCCATGCCCG CGGCTGGACC CAGACGCAGG CCGCGAACCA CATCAACGCC
CACGCCGCCC GCGCCGGCCT CGACCCGGAC GGCGCTGCCC CCATGACCGG CCCGAAGCTG
TCGGAACTGG AGACCTGGCC GCTGCCGAGC AACCGCCGCC GGCCCACCCC CCAGATTCTC
GCCCTGCTCG CCGAGGTCTA CGACACCAGC ATCCACAACC TCATCGACCT GGACGACCGC
GAACACCTTC CCCCCACCGA CATGCTCCTC ATCAACACCG CGCGGAGGAA CCCTGCGGCG
GACCCGGAAC GTGGATCAGC ATCGGCCCCT GTGCTCGCGG ACGACTCAGG CCAGCCTCGC
AGAGGCGAAC AGATCACGAT GCCAGTCGTA CCAACAGCGC AGGTTATGAG GCCCGCAGAT
GCTACGGGAG CCGCGATGCC CGACCTCGAC CGACGAGACT TCGTCACCGC GACGGCCACT
GCGCTCACGT TCGGCAGGCC CGCCCTCCGG AACGTAGACC CGGCCCTCAT CGACTACTTC
AACCAGCAGC TGGAAGGCCA CTACCAAGCG GACATGATGC TCGGCCCCCG CGAGCTAATC
AGCACCGTCA CCGCCCAGCA CACTCTCATC AGCAATCTGG TGGCGACCGG CCACGGGGGC
ACGCGGCGGG CCCTTCTCGG CGCAAGCGCG GCCTACGCCT CACTGATCGG CTGGCTCCAT
CAGGACGCCG GTGACCTTCC GAGTTCGTCC GTCTGGCGTG GCATCGCGCT GGAAGCCGCA
CAACGCTCCC GGGACCACCA GCTCGTCGCC TACGCGCTGC TCAACCACGC ATCCGTTCGC
ACAGATCTTG CCGACGGTCT CGGTGCGCTC GACCTGTGTG GCGCGGTCCT GGCAGACGCC
GGCCGGCTCA GCCCGAAGAT GCGGGTCCTA GCGCTCCAGC AACAGGCCCA CGGCGCGTCG
CTCATCGGAG ATCGTGCGAC CGTCGACTCC GTCCTGGACC AAGCGGCCCC GCTCGTCGAG
AGATGCGACG ACGCCATGCC GTGGGGCAAC GCCTGCCGGC GCACGCCGGC CTACCTGGAG
GTGCAACGCG CCACCTGCTA TGGACGCTTG GGACTGGCCA CCGCCGCCAC CGGCCTGTGG
CAGCAGGTTC TCGCAACGAC GCCTACCCAT GCCCGGCGGG ACCGCGGCGT GTATCTCGCC
CGTCAGGCCA CCGCATGTGT CCAGAACGGA GATCTGGGGC GCGCGGTCGA GGCCGGACGG
CTCGCGGCGG ACGTGGCGGT GGAAACTGGG TCTGTGCGGA TTCGCCGCGA GCTCGCCGGC
CTACGGCAAG CCGCAGAACC GTGGAAGGGC ACGGCCGTCG GCCGCGACCT CGACGAGATC
TTCGCGCTCT GA
 
Protein sequence
MSRHSLRVEQ DELRARMRTV GMSHDEIAIE FARRYHYRPR AAHRHARGWT QTQAANHINA 
HAARAGLDPD GAAPMTGPKL SELETWPLPS NRRRPTPQIL ALLAEVYDTS IHNLIDLDDR
EHLPPTDMLL INTARRNPAA DPERGSASAP VLADDSGQPR RGEQITMPVV PTAQVMRPAD
ATGAAMPDLD RRDFVTATAT ALTFGRPALR NVDPALIDYF NQQLEGHYQA DMMLGPRELI
STVTAQHTLI SNLVATGHGG TRRALLGASA AYASLIGWLH QDAGDLPSSS VWRGIALEAA
QRSRDHQLVA YALLNHASVR TDLADGLGAL DLCGAVLADA GRLSPKMRVL ALQQQAHGAS
LIGDRATVDS VLDQAAPLVE RCDDAMPWGN ACRRTPAYLE VQRATCYGRL GLATAATGLW
QQVLATTPTH ARRDRGVYLA RQATACVQNG DLGRAVEAGR LAADVAVETG SVRIRRELAG
LRQAAEPWKG TAVGRDLDEI FAL