Gene Franean1_0801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0801 
Symbol 
ID5669217 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp934204 
End bp936909 
Gene Length2706 bp 
Protein Length901 aa 
Translation table11 
GC content75% 
IMG OID641239729 
Producthypothetical protein 
Protein accessionYP_001505165 
Protein GI158312657 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0459133 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTG ACGAGTGGCA TGGCCACGGC CTCGGCCGCC CCGATCTGAC CCGTGAGAGC 
CGGGCCGACG TGCCGACCGG TGAATGGGTC GTCGACGACG GCCCCGCGGG CACCCGGCCG
GCGTCCGACT GGTCGATCGG GGTCTATCGG CGCGGTGACG AGGTTCCGGG GCCCGGCCCG
GCTACCGGAG CGACGGCCGC CACCGGCGCC ACCGGCGCCA CCTCCGCCAC CTCCGCCTCG
CCGCCCGCTG GAGCGGCCGC GGTTCCCGCC CCTCCCGCCC AGCCGGCCAC CGGCAGGGCT
CCCGGCTCAC CTTTTGACCC CGAGATGCTC TCCTCCCGGC GCCGGCCGGC GCGCCGCCGT
TCGCCGGCCG CGCCGATCGC CCCCTACCCG GAGGGCCCGG CCTACCCGGG ATCCGCCACG
CGCGGCGAGT CCGGGGCGCG CGGCGAGTCC GGGGCGGAAT GGCCCGATGA CCACGAGTCC
GGCCAACGCG GACGGGGCCA CAGCGGGTCC CGGCGCGATC GCGACTCCAG ATGGGACGTG
CCCGGATGGG ACGCCCCCGG ATCCGACCGT CCTGGCCCCG GCGAGGGCGA ACAGCGCCAG
CGTCAGCCTG AACGGCACCG GTCCGGACCC GGCCGGCCCG CGGGTCGCGG CGCCGGTGAG
CCCACGTCGG GGTACGGCGA ACCCGGCCAC GGCCACGGGC GCGATCCGTA CCGTGGCGCG
GAGGGGCGTC CCGGCGCGGC GGGGCACTGG GATGAGCCGG ACTGGGCCGG CGAGCCCTCC
CGTCCCGCCC CGTGGACCCC ACCGCCCTTC GAGGCACCGT CATCCTCGGC ACCGCCCGCA
CGGGTCCCGC CGCCGGCGGC ACCGCCGTTC TCGGCACCGT CGTTCTCGGC ACCGAGCTCG
CGGGTGCCGC CGCCGCCGTC GGAGTCACCG TCCTCGACGT GGCCGCCCGG TGATCTCGCC
ACCCAGCGTT TCCAGTGGCG AGACTGGGAA GGTGAGCCGC GGCCAGGAGG ACCGAGGCCC
ACCCGGCCGG GCGGGGAACG CGCAGACGAG ACGATGATCC TCCCGCCGTC CCGGCCGGAG
CAGGGCGCGC GCGAACGCGG GGACCGCGAC CGGCAGAGCC GACGGTCGGG CCGTGACCAG
CGGAACCATC AGCCGGGCCG TGCCCCCCGC GACCAGCCGG ACCCTGATCG GCGCGGCGAC
CAGCCGGGCC TTGACCGACG TGACCGGCCC GGCCGGGATC GGCGGGGCGA TCAGCCGGAG
CGGTCGCGGC GGGGGCAGGA GCCGGCCGGG CGGGCGCAGG AACATCCGGA GCGGCCGCGG
CAGCGGCGTT CGGCCACCGA CCGCGCCCGG TCCGATCGCG CCCGGTCCGA TCGCGCCCGG
CCTGACCGCG CCCGGTCCGA GCGCTCGCGG CCCGGGTCCG CCGGCGGGCA CGTCACCGGG
GTGCTGCCCA TTGCCGACCC CGTCCGGGAC GGGCGTGGTT CCGCGCAGCC GCCGCGGACC
GAGCCGGCGG CCGACGCATC GTTGCCACAC CGGGTCACCG ACTGGATCAT CGAGCACTGG
GGCGCGAAGC CTGGGCGGCA CCCGTACCTG CCAGTGGGCC TGGTCGCCCT GGCGGCGGTG
CTGGCCCTGG TCGCCTGGAT GCTCGGGCCG GCCCAGAACA GCCCGGCGAG CGTGGACGCG
GCCCCGGTCA CGCCGACTCC GTCCGCCGCA CCCTCGCCTG CCGCCGTGCC GCCGCCGGCA
CCCGCCTCGG TCGAGCCCGC CCCCGGATCC GGCGCGGTCG CCCCGCCATC GGGGCAGGTG
ACCCGCGTGG CGCGGGCGGC GACCGCGGGG AGTTTCCCCA GCGGGGTGGC CGCGCACACG
GTGGCGGAGG CGACGGCGTG GGCGCAGTTC CGCGGCCGGC CGGTGGACGT GGTGGTCACC
TACACGGACC GGAACAGCTG GGACGCGATC GTGAACCCCT GGATCGGCCG CAGCGCGTCG
ACGTTCTCAA ACTTCGCCGG CACCCTGGTC ATCAGCGTTC CCCTTTTTCC GGATGAGGGC
CCGGAGCTGG GAAATCTCAC CGACTGTGCC GCTGGTGACT ACGACGCGAA ATGGCGCCAG
TTCGGCCGGT GGCTGGTCAG CGAGGGCCGT GGGGACTCGT TCGTCCGCCT CGGCTGGGAG
TTCAACGGCG ACTGGTTCGC CTGGCGGGCC TCAGCGAGCC CGACGTCCTA CGTGCAGTGC
TTCCGCAACG CCTCGGCGTC GATCAAGGAG ACGAGCCCGA AGGTCCGCAT CGACTGGAAC
ATCAACGCCC ACGGGCCGCG CAGCGCCTTC GCCGTCTACC CGGGCGACCA GTACGTCGAC
GTCATCGGCA TCGACAGCTA CGACCAGTAC CCGCCGAGCC CGACCCTCAG CGCCTTCGAC
GCCCAGTGCG ACGCCACCGA AGGCCTGTGC CAGGTGATCA GTTTCGCCCG CCGGCACAAC
AAGCTGTTCT CGGTGCCCGA GTGGGGTGTG GTCAGCCAGC AGAACACCAA GGCCGGCGCC
GTCGGCCAGG CGGGCGGGGA CAACCCGGTC TACATCGAGC GGATGTACAG CATCTTCGAG
CGCAACGCGG ACATCCTCGC CTACGAGGCG TACTTCAGCG ACGACGTCCC GGGCAACGTC
CACTCGTCCC TGCTCAGCCC CAACCGCCAC CCACGCTCGG CGGACACCTA CAAACGACTC
TGGTAG
 
Protein sequence
MPPDEWHGHG LGRPDLTRES RADVPTGEWV VDDGPAGTRP ASDWSIGVYR RGDEVPGPGP 
ATGATAATGA TGATSATSAS PPAGAAAVPA PPAQPATGRA PGSPFDPEML SSRRRPARRR
SPAAPIAPYP EGPAYPGSAT RGESGARGES GAEWPDDHES GQRGRGHSGS RRDRDSRWDV
PGWDAPGSDR PGPGEGEQRQ RQPERHRSGP GRPAGRGAGE PTSGYGEPGH GHGRDPYRGA
EGRPGAAGHW DEPDWAGEPS RPAPWTPPPF EAPSSSAPPA RVPPPAAPPF SAPSFSAPSS
RVPPPPSESP SSTWPPGDLA TQRFQWRDWE GEPRPGGPRP TRPGGERADE TMILPPSRPE
QGARERGDRD RQSRRSGRDQ RNHQPGRAPR DQPDPDRRGD QPGLDRRDRP GRDRRGDQPE
RSRRGQEPAG RAQEHPERPR QRRSATDRAR SDRARSDRAR PDRARSERSR PGSAGGHVTG
VLPIADPVRD GRGSAQPPRT EPAADASLPH RVTDWIIEHW GAKPGRHPYL PVGLVALAAV
LALVAWMLGP AQNSPASVDA APVTPTPSAA PSPAAVPPPA PASVEPAPGS GAVAPPSGQV
TRVARAATAG SFPSGVAAHT VAEATAWAQF RGRPVDVVVT YTDRNSWDAI VNPWIGRSAS
TFSNFAGTLV ISVPLFPDEG PELGNLTDCA AGDYDAKWRQ FGRWLVSEGR GDSFVRLGWE
FNGDWFAWRA SASPTSYVQC FRNASASIKE TSPKVRIDWN INAHGPRSAF AVYPGDQYVD
VIGIDSYDQY PPSPTLSAFD AQCDATEGLC QVISFARRHN KLFSVPEWGV VSQQNTKAGA
VGQAGGDNPV YIERMYSIFE RNADILAYEA YFSDDVPGNV HSSLLSPNRH PRSADTYKRL
W