Gene Franean1_4524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4524 
Symbol 
ID5672873 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5396611 
End bp5397735 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content66% 
IMG OID641243389 
ProductABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein 
Protein accessionYP_001508805 
Protein GI158316297 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTA TGCGACGACT GTCCTTACTT CCCTTACGAA GGTCGTCTTC GTCGCCGAGG 
CCGCGCCGGA CCGTGTCCGC TGTCGCTGTC CTTGCGGTCG CCGGGAGTAT AGCCGCGGGC
TGCGGGGGCT CCTCGGACAA CAGCGCGGGG TCGGACGGGG CGCTGAAGGT AAAGATCATG
TCGACCTCGG CGACCTTTAC CGACCTCCCC ACCGTGGTCA TCGTTGCGCA GGACTACTTC
AGGCAGGTCG GACTCGACGC CGATGTCAGT TTCTCGAACG CCAGCAATGC ATCTCTGATC
ACCCAGGCCG TGATCTCCGG GGACACCAAC ATCGGTACGT CCGGTGCGGG CTCGCTGTAC
AACGCCTATG CCGAGGGCAA GACCAACCTC GTCAGCCTCG GAACCACCAA CCCCAGTATC
ACCTTCGGGC TGGCGCTGAA CCAGGAGACG CTGGACACTC TCGCCGAGCG CGGAGTAACA
CCGGAGTCGT CCGCGGAAGA GCGCGTCCAG GCGCTGCGTG GCCTCACTCT TACCTCCTCG
CCGGAGGGCT CCACCGGCAA CACCTATCTG CGCGTCATGC TCAGCGAGTA CGGAGTCGAT
CCGGATCGCG ACGTGACGAT CCTTCCCAAC AACGACGCCT CCGCCCAGAT CGCGACCACG
CGGCAGGGCC GGGTCAGCGG GTTCGCCCAG TCGTTCCCGC GGGTCAACTT CCCCGAGGCG
GAGGGCTGGG GCGGGCTGTG GCTGAACTGG GCGGTCGACC TTCCGTCCAT CCTTCCGCTG
GCCTCGCACG AGTACTACAC CACCCGCTCC TGGCTGGAGC AGAACCCCGA GATCGCCAAG
CGCGTCATGC AGGCCGTGTG GCTCGCCGAC CGGGACCTGC ACAACCCGAC CGATGAGCTG
CGGGACAAGG TGCGGGGATT GCCGCAGTTC GCCAACCTGA ACGAGACGGC CTTCAACGCG
GGCTGGGAGG TCGCGGTCGG TGCCTACAAG GACGCGTCTC CCCTGACGAC CCAGGAGATG
TTCGACAACC AGGTCCGGCT CGTGAACCTC AACCGTGACT CGCCGCTCAC CTTCGGCTTC
GACGACATCT ACGACCTGAG CGCCGCGAAG GCCGCGCAGC CGTGA
 
Protein sequence
MPAMRRLSLL PLRRSSSSPR PRRTVSAVAV LAVAGSIAAG CGGSSDNSAG SDGALKVKIM 
STSATFTDLP TVVIVAQDYF RQVGLDADVS FSNASNASLI TQAVISGDTN IGTSGAGSLY
NAYAEGKTNL VSLGTTNPSI TFGLALNQET LDTLAERGVT PESSAEERVQ ALRGLTLTSS
PEGSTGNTYL RVMLSEYGVD PDRDVTILPN NDASAQIATT RQGRVSGFAQ SFPRVNFPEA
EGWGGLWLNW AVDLPSILPL ASHEYYTTRS WLEQNPEIAK RVMQAVWLAD RDLHNPTDEL
RDKVRGLPQF ANLNETAFNA GWEVAVGAYK DASPLTTQEM FDNQVRLVNL NRDSPLTFGF
DDIYDLSAAK AAQP