Gene Franean1_2728 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2728 
Symbol 
ID5671119 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3225870 
End bp3229160 
Gene Length3291 bp 
Protein Length1096 aa 
Translation table11 
GC content67% 
IMG OID641241640 
Producthypothetical protein 
Protein accessionYP_001507060 
Protein GI158314552 
COG category[S] Function unknown 
COG ID[COG4995] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCGGTG GCGAATCGCC TACGGCCGTG CTGACTGATG AGTCCGCAAC CGAGGTACCC 
GGGCTGTTCG ACGAGTTGTA CGCGACGCCG AAGCGACTGG AGGACCTCGG CGCGCCCCTC
AGACGGTTCG CAGAGGCCAC GGGAACGGAC GGCGACGGGA TGCGCCAGGC TTTCGCCTTC
TTCCTGACGG CTCACGGCTA CGCCCCTCAT CTCGTCCCGG AGTGCCTCCG CGCATTTTTT
GCGGAGATAG CGGCCCATAC GGACGTGTCA GCCGGGCAGG GCGATCTGAT GCAGGCTGAG
GCGCTCCTCG CGCGCGCTGA GCGAGCCGAT GCTGGCGGGG TGATCGGCCT GTTGGACTCG
GCGGTCCGGA TGTTGCGCCG GGCCGTCGCC AACTTCGGCA TTGCTGACTC CGGTCGGTTC
AACGCGGTCG GCCTGCTCTG CCTTGCGCTT CGATGGCGCT ACCGCCTGAC GGGTCGGGGG
GAGGACCTGT CCGAAGGTAT CGACCTCCTG CGTGCGTCGA TCTCGGAGTG GCCGGACGAT
CCTCGCCGTG GGGCGCTCCA CGCCGAATTG AGCGTCGCGC TGCGGCTGCG CGCCGAACGG
ATCGGCGACC GAGCCGACCG GGACGAGGCC GTCGAGCAGG GCCGTCTGGC GGTCGGCAAG
GCGCAGGACC ATCAGTGGGT CGTCGCGGAC GCGCTCGCCG CGTTGGGAGC CGCGCTGCTG
ACACGGGCCG CGTGGATCCT CAGTCCTGAC GATGTAGAGG AAGCGGTAGC GGTCGCGCGG
CAGGGAATAG TAGTAGCAGC CCCCGACCTT AATCGACTGG GCTACGCGTT AGAACGGCTA
AGCGTGGCAC TACATACGAG ATTTGATACA TTCGGTGACG TCGCGGACCT GGTCGAGAGC
GTGGAGGTCG CCCGCCGCTG CGTCGAGGCG ACCGAAGACG CGTCGCTGGA CGCGGTGGGC
CGACGGTCAC AGCTGTGCTT CTGCCTCCTG GACCTCGCGC AGACGACCGG GTCAACAGAA
CTGCTCGACG AAGCGATCGC CCAGGGTCGC GCGGGCATAA ATGCGCTCAC GGCGTTCCAT
CCACGTGCCG GGCTACTGTG GGTTAATCTT AGCGACTGCT ACCGCACCCA CTATGAGGTC
ACGCGTTCGG TGACCGCCGT CGAGGCAGCT GTCCATGCAG CGCAGGAAGC GCTCGCGGTA
ACTTCGGCCG GCCATCCAGA CCATCCCAAA GCGCTGTTCG CGCTGGCCAG CGCGCTGTGG
TGGCAGTTCG AGCGAATGGG CGACACCGTA GATCTAGACG CGGCGCTCGA CTATGCCCGG
CAGGCACTGG ACGCGACGGC ATATGAGCGG CTCGACTATC CTAAGATTGC TGGGTTGCTC
GGACAGCTGG CTGCGGCCGC GGCAGGTCCC GCCAGCATCG ATGCGTTCGC CGAAGCAGTC
GCCGCGGGAA GATCGGCAGT CGCGGGCATC CCTGACGGAC ATCCTGACCT TCCGCTCTAC
CAGTCGAGCC TGGCCGCGGT TCTCCGCCGG CGAGCGGATG CGATCGGACA GCTGTCGGAT
CTCGACGAGT CGATCTCCCT GGACCGAGCC GCAGCGGATC AGGTTCCCCC TGGCCGCCCC
GATCGCGCAC GGCTCCTCTG CAATCTCGGA AACACCCTGC GCCTGCGTTT CCGAGTCACC
GGGGACAACG AAGATCTTCG GGATTCCCTG CACTTCTTCA GGACGGCCTT GAACACGGGG
ACGGCTCTGC CGTCCACCCG CCTGCAGGCA GCCAGCAGCC TCGGACGACT GGCCGGCGAG
ACGGGACAAT GGGCCATCGC CGCCGACGGC TACGCCACGG CCATTGGCCT GCTGCCGGTG
GTGTCCGCGC CAACCCTCCT TCGAACCGAC CAGGAATTCG TCCTCAGCCG GCTACCCGGT
CTCGCCGCCG ATGCGGCGGC GTGTTTCCTT CGCGCCGGCA TGCCGGAACA GGCTGTCCAG
CGTTGGGACG AGGGCCGCGG GGTCCTGCTA GCTCAGGCTC TCGGTATGCG AAGCGACCTC
TCCCGACTCG CGGACCATCA TCCCGAGGTC GGCCGACGCG TCGGCGCTGC CTGGGGCCGT
GCCCTTCGCA ACCAGGAACC GCGGCCATCG ACCGACGCGG CCAGTGACCT CAAAGAGCTG
ATCCAGGAGG TCCGCGGCCT GCCTGGATGG GAGGGCTTTC TCGCGCCGCC GACGCCGGTC
GACCTCGACA TGGCAGCCCG TGGTGGCCCA GTCGTCGTGG TCAACGTCTC CCAGTTCGGC
TCAGACGCAA TGATCATCAC CGCTGCCGGT ATCAACACCG TGCCGCTCCC AGGACTCACG
CCTATTGCTG TGCAAGACCA GGTCATCAGC TTCCTGCAAG CCTCGAATGG CGCCACCGCC
GATCCCGACT GCGAAATGCA ACAGACGCTC TCGTGGCTGT GGAACGCGAT CGCCAGTCCA
GTACTCGACA GGCTCGGTGT CAGCGCGGTG CCCGATGCCG GTGAACCGTG GCCTCGGCTG
TGGTGGTGCG TCTCCGGGCT GCTGTCGTTC CTGCCGCTGC ACGCAGCCGG TCACCACCAC
ACAAGGTTCG ATCCTGAGCC ACAGACCGTC ATCGACCGGG CCGTGTCCTC GTACGCACCG
ACCATCACCG CACTCATCGG CGCGCGAAAC AAACGCACCG CAGCGCCTAG CGTTGGCAAC
GCTCCGGACG TCCTCGTCGT CGCTCAACCA CAGACCCCCG GCCAATGCAG CCTGCCCGGC
GTACTCACGG AAGTCGCGGA CCTTAAAACC CGTCTGCCAG GCCGGATCAC CGAGCTGACC
GGAGCGCACG CTACTCGGGA AACTGTGCTC CGAGCCCTTA CTTCCTCTCA CTGGGCCCAC
TTCGCCTGCC ACGGGACCAG CGACCTCGCC AACCCGTCGT CCAGCAGCCT GCTCCTGCAC
GACTATCAGA CCTCACCACT AACAGTGATC GACCTCATGC AGCTCCAACT CGATCAAGCG
GAACTAGCAT TCCTCTCCGC CTGCGAGACA GCCCGCCCCG GCGCACAGCT CAGCGACGAA
GCCATTCACC TCACCGCCGC CTGCCAGCTC GCCGGCTTCC GCGACGTCAT CGGGACGCTC
TGGCCAATCA ACGACCTGGC CGCCGTCGAA TTCGCCGATG CCTTCTACAG CGCCCTCCTG
AACGAGTCCA CCGATGTCGC AGCCGCCGCT CATGCCGCAA CACGGCACAC CCGCGAGACA
TGGATCGACC AGCCCTCTCT CTGGGCTGCC CACGTGCACG TCGGCACCTG A
 
Protein sequence
MFGGESPTAV LTDESATEVP GLFDELYATP KRLEDLGAPL RRFAEATGTD GDGMRQAFAF 
FLTAHGYAPH LVPECLRAFF AEIAAHTDVS AGQGDLMQAE ALLARAERAD AGGVIGLLDS
AVRMLRRAVA NFGIADSGRF NAVGLLCLAL RWRYRLTGRG EDLSEGIDLL RASISEWPDD
PRRGALHAEL SVALRLRAER IGDRADRDEA VEQGRLAVGK AQDHQWVVAD ALAALGAALL
TRAAWILSPD DVEEAVAVAR QGIVVAAPDL NRLGYALERL SVALHTRFDT FGDVADLVES
VEVARRCVEA TEDASLDAVG RRSQLCFCLL DLAQTTGSTE LLDEAIAQGR AGINALTAFH
PRAGLLWVNL SDCYRTHYEV TRSVTAVEAA VHAAQEALAV TSAGHPDHPK ALFALASALW
WQFERMGDTV DLDAALDYAR QALDATAYER LDYPKIAGLL GQLAAAAAGP ASIDAFAEAV
AAGRSAVAGI PDGHPDLPLY QSSLAAVLRR RADAIGQLSD LDESISLDRA AADQVPPGRP
DRARLLCNLG NTLRLRFRVT GDNEDLRDSL HFFRTALNTG TALPSTRLQA ASSLGRLAGE
TGQWAIAADG YATAIGLLPV VSAPTLLRTD QEFVLSRLPG LAADAAACFL RAGMPEQAVQ
RWDEGRGVLL AQALGMRSDL SRLADHHPEV GRRVGAAWGR ALRNQEPRPS TDAASDLKEL
IQEVRGLPGW EGFLAPPTPV DLDMAARGGP VVVVNVSQFG SDAMIITAAG INTVPLPGLT
PIAVQDQVIS FLQASNGATA DPDCEMQQTL SWLWNAIASP VLDRLGVSAV PDAGEPWPRL
WWCVSGLLSF LPLHAAGHHH TRFDPEPQTV IDRAVSSYAP TITALIGARN KRTAAPSVGN
APDVLVVAQP QTPGQCSLPG VLTEVADLKT RLPGRITELT GAHATRETVL RALTSSHWAH
FACHGTSDLA NPSSSSLLLH DYQTSPLTVI DLMQLQLDQA ELAFLSACET ARPGAQLSDE
AIHLTAACQL AGFRDVIGTL WPINDLAAVE FADAFYSALL NESTDVAAAA HAATRHTRET
WIDQPSLWAA HVHVGT