Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4524 |
Symbol | |
ID | 5672873 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5396611 |
End bp | 5397735 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 641243389 |
Product | ABC-type nitrate/sulfonate/bicarbonate transport systems periplasmic components-like protein |
Protein accession | YP_001508805 |
Protein GI | 158316297 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCTA TGCGACGACT GTCCTTACTT CCCTTACGAA GGTCGTCTTC GTCGCCGAGG CCGCGCCGGA CCGTGTCCGC TGTCGCTGTC CTTGCGGTCG CCGGGAGTAT AGCCGCGGGC TGCGGGGGCT CCTCGGACAA CAGCGCGGGG TCGGACGGGG CGCTGAAGGT AAAGATCATG TCGACCTCGG CGACCTTTAC CGACCTCCCC ACCGTGGTCA TCGTTGCGCA GGACTACTTC AGGCAGGTCG GACTCGACGC CGATGTCAGT TTCTCGAACG CCAGCAATGC ATCTCTGATC ACCCAGGCCG TGATCTCCGG GGACACCAAC ATCGGTACGT CCGGTGCGGG CTCGCTGTAC AACGCCTATG CCGAGGGCAA GACCAACCTC GTCAGCCTCG GAACCACCAA CCCCAGTATC ACCTTCGGGC TGGCGCTGAA CCAGGAGACG CTGGACACTC TCGCCGAGCG CGGAGTAACA CCGGAGTCGT CCGCGGAAGA GCGCGTCCAG GCGCTGCGTG GCCTCACTCT TACCTCCTCG CCGGAGGGCT CCACCGGCAA CACCTATCTG CGCGTCATGC TCAGCGAGTA CGGAGTCGAT CCGGATCGCG ACGTGACGAT CCTTCCCAAC AACGACGCCT CCGCCCAGAT CGCGACCACG CGGCAGGGCC GGGTCAGCGG GTTCGCCCAG TCGTTCCCGC GGGTCAACTT CCCCGAGGCG GAGGGCTGGG GCGGGCTGTG GCTGAACTGG GCGGTCGACC TTCCGTCCAT CCTTCCGCTG GCCTCGCACG AGTACTACAC CACCCGCTCC TGGCTGGAGC AGAACCCCGA GATCGCCAAG CGCGTCATGC AGGCCGTGTG GCTCGCCGAC CGGGACCTGC ACAACCCGAC CGATGAGCTG CGGGACAAGG TGCGGGGATT GCCGCAGTTC GCCAACCTGA ACGAGACGGC CTTCAACGCG GGCTGGGAGG TCGCGGTCGG TGCCTACAAG GACGCGTCTC CCCTGACGAC CCAGGAGATG TTCGACAACC AGGTCCGGCT CGTGAACCTC AACCGTGACT CGCCGCTCAC CTTCGGCTTC GACGACATCT ACGACCTGAG CGCCGCGAAG GCCGCGCAGC CGTGA
|
Protein sequence | MPAMRRLSLL PLRRSSSSPR PRRTVSAVAV LAVAGSIAAG CGGSSDNSAG SDGALKVKIM STSATFTDLP TVVIVAQDYF RQVGLDADVS FSNASNASLI TQAVISGDTN IGTSGAGSLY NAYAEGKTNL VSLGTTNPSI TFGLALNQET LDTLAERGVT PESSAEERVQ ALRGLTLTSS PEGSTGNTYL RVMLSEYGVD PDRDVTILPN NDASAQIATT RQGRVSGFAQ SFPRVNFPEA EGWGGLWLNW AVDLPSILPL ASHEYYTTRS WLEQNPEIAK RVMQAVWLAD RDLHNPTDEL RDKVRGLPQF ANLNETAFNA GWEVAVGAYK DASPLTTQEM FDNQVRLVNL NRDSPLTFGF DDIYDLSAAK AAQP
|
| |