Gene Franean1_7001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7001 
Symbol 
ID5675312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8531462 
End bp8534125 
Gene Length2664 bp 
Protein Length887 aa 
Translation table11 
GC content73% 
IMG OID641245847 
Producthypothetical protein 
Protein accessionYP_001511238 
Protein GI158318730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.890156 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGTTGA AGCAGCTGTA CTACACGTCG TGCGAGGTCG GCCTTTCGGG ATACCCCGGA 
TTCCAGTTCA ATGCGGCGAG CCGCGGGGTC GCTCCGGAGG TGATGCGGCG GGTCGAGTCC
TTCACCGCCT ACGACCCGCC GGGCTCGCTG CCCTACAACG CGGATGCCGC GGCGCTCGCG
GCCTGCCCGG TGAATCTCTG CTTCGAGCCG GCGGAGGACA CCGGCGTGCT GGCGAACGTC
GTCTTCGTGG GCACGGACTA CTCGAACCGG TTCGGGAACT ACTTCGCGCA CGCGTTGGTC
AGCGAGGACG TCGCCACCGA TCTCGGGGAG GCGTTGCCCA TCGAGCTGTG GGGCGCCCCG
ATCTGGACGG CCAGCCCGAA GACCGACGGC GCGGAACTGC CGGAGCTGGC CGGGCCGCCG
CCACGGGGAA GGCTCGACCG CCGGCACGTC GCCGCCTTCC TGCGTACGCA TCCCGGCCGC
GCCGCACTGC CCGCCCTGGT GTCCGCGGCG GCCAGGGTGA TCACCGGCGA ACGGCGAACG
ATCGTGCTGA TCGAGCGGGA CAGCGGGGCC GCCGCGAACT GGATCGGCGC CGTCTCCTAC
CTGCTGCCGC CGGCGCTGGC CCGCCGGATG TCCTTCGCCA CCTACCACCA CCGGCCCGCC
TACAGCGACC TGCACCTGGT CGCGACAGTG CCCGACGCCG ACGTCGACCG CGGTGACGGC
GCGTTCGCGA GCTTCCTCCA GTTCGACATT CCGGCCAACC GGAGCAGCCA GGTGCCGGTC
GAGCCGTGGG CCGCCCTGCT GGCCGAGACC GGCGTGGAGA ACGCGGCCGA GCTGTGGGCC
GCCGCGGCCG AGCTCGGCGG CGGCCTGCCC GTCCGCTCGG GAGAGTGGCA TCCGCTGCTG
GCCGCGGCGC TGTTCCGGCA CGGTCACCAG GCCTCCGGCG CCGGGCCGGC GGTGACGGCC
GCCGCCGCGA CCGCCGCGCG CTGGCTGGGG CGCCAGGCAC CCTCGCTGGA CGCGCGAACT
ATCGAACTGA TCGGCGTCGG GGCCCTGGGG GTCCTCGCCG GGAGCCGGCC GCCCGGCACC
GGCGCCGCAC TCACCGCCCT CGCGGCGGCC GCGGCGGAGG CCGGCGCCGC GCCGCTGGCC
ACCGAGATCG AGAAGCTGGC GGTGGACATC GAGGTGGACG GGCTGCTGGC GCACGAGCCC
GTCAAGGGTG CCGGTGGCGC CGAAGCGGTG CCGCTCAGGT CGCCTGCCGG TCGCGCCCAC
GCCACCGACC GCTGCACGCG AGCCCTGCGC AGATGCACCG CCGAGGTGGC GGTCGATCTG
CTCCTCCTGG CTGCCCGGGA GCGGCTCGAC CTGGACAGCG CCCTGCTGCG GACGGCCGGA
GAACGCGTCG TCGGCCCCCG GCTGCTGGCC GAGCCGACCG AGGCGACGGC ACGGCTGGCA
CGTGGCTGGC CCCAGCTGCG GGCGGGGGTG GTGCATCACC TCGCCGCGGT CGCGGCACGC
GAGCCGGACC GTCTCGTCGA GGCGCTCACG CACGGCCTCG GTGAGGCGCT GGACGACGAC
GACACGCGTG CTGCCCCGGC TCTGCACGAA GCCCGGCTGA TCGCCGCGTC CCGCCGCGAC
CCGCGGCTGC GGATCGACAT GCTGGCGGAG ATCGCGATGC TGCGGGCAAG CGGTCCGGAA
TCGACGGGCA CACCGGCCGT GGACGCGGAC CTGTTGCGGC GCGTCTGGCC GCACGGAAGC
TGGACGCCGG GCGAGGCACT GCGCGCACTT GAGGTGCTCG ACGAAGCCAG CCTGAATGCT
CCCGCCACCG TGGCCTGGCT GAACGCCGTC CTCACCCGCG AGTGGCCGGC GACGGCCGCA
GCGGACGGCG ACCATCTCGA GGACCTCGAA GCCCTCTGCC ACGCGTTCGA GGGGAGGCCA
GCCGCGTCCC GGCTCAGTGA CCCCGCACGC GCGACGATCA CCCGGATGAC CACGGTCAGA
CGCAGGCTGG CCGCTGCCAC CGGCCCTGGC CGCCAGGCCG CCGTCCTGAC CGCCGGGCTC
GACCGGATGC CCGAGACGCA GCGGGCACAC CACCTCAGCA AACTGACCGA CCTGCTGAAC
AAGGCACCGG CCGGGGACCT CCCCGAGATC TTCGCGTCGT GCCCCGCGGA GGCCGTGGAG
CACTACCTGC TCGACGTCGA CACCTATCTC AGGGCCGACC GTGCCCAGAC GGACATCGCC
GGCCGTCTCT TCTACGCGGT GGTCATCCTG CGGCGCAATC CGTCCCGCAA GGTCCGCGTC
GTCGCCACGG TCATCGAGGA CCGCCTGCTG GACAGGCTGA TCACATGGCA CTGGCAGGAC
CTGCGCGCCC TGCATGTCTG GATCAGGAAG CTCGATCCCG CGGTGGCCAC GGATTTCGCC
AACTGGCGGG ACCTCAACGC CGCCGGATGG CTGCGCCGGA CCCTGACGCG GCGTCGGCTG
GAGAAGGAGG CCGACGCGCC ACGGAGGGCG GCGGAGAAGA AGGCCCGGGC CGAGCGGAAG
GCCGCGGCGG AGAAGAAGGC CGCCGCCGCA CAGAAGGCCG CCGACAAGAA ACGCTCGGGA
AGCGGAGGGA AGAACGGTAA GAACGGGAAA TCTGGGACGG AAGCCCAAGG GAAGGCTGCG
GCGAAAAAAC GGAACCCTCG TTAG
 
Protein sequence
MVLKQLYYTS CEVGLSGYPG FQFNAASRGV APEVMRRVES FTAYDPPGSL PYNADAAALA 
ACPVNLCFEP AEDTGVLANV VFVGTDYSNR FGNYFAHALV SEDVATDLGE ALPIELWGAP
IWTASPKTDG AELPELAGPP PRGRLDRRHV AAFLRTHPGR AALPALVSAA ARVITGERRT
IVLIERDSGA AANWIGAVSY LLPPALARRM SFATYHHRPA YSDLHLVATV PDADVDRGDG
AFASFLQFDI PANRSSQVPV EPWAALLAET GVENAAELWA AAAELGGGLP VRSGEWHPLL
AAALFRHGHQ ASGAGPAVTA AAATAARWLG RQAPSLDART IELIGVGALG VLAGSRPPGT
GAALTALAAA AAEAGAAPLA TEIEKLAVDI EVDGLLAHEP VKGAGGAEAV PLRSPAGRAH
ATDRCTRALR RCTAEVAVDL LLLAARERLD LDSALLRTAG ERVVGPRLLA EPTEATARLA
RGWPQLRAGV VHHLAAVAAR EPDRLVEALT HGLGEALDDD DTRAAPALHE ARLIAASRRD
PRLRIDMLAE IAMLRASGPE STGTPAVDAD LLRRVWPHGS WTPGEALRAL EVLDEASLNA
PATVAWLNAV LTREWPATAA ADGDHLEDLE ALCHAFEGRP AASRLSDPAR ATITRMTTVR
RRLAAATGPG RQAAVLTAGL DRMPETQRAH HLSKLTDLLN KAPAGDLPEI FASCPAEAVE
HYLLDVDTYL RADRAQTDIA GRLFYAVVIL RRNPSRKVRV VATVIEDRLL DRLITWHWQD
LRALHVWIRK LDPAVATDFA NWRDLNAAGW LRRTLTRRRL EKEADAPRRA AEKKARAERK
AAAEKKAAAA QKAADKKRSG SGGKNGKNGK SGTEAQGKAA AKKRNPR