Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0251 |
Symbol | |
ID | 3903659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 289213 |
End bp | 290253 |
Gene Length | 1041 bp |
Protein Length | 346 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637877579 |
Product | putative sulfonate binding protein precursor |
Protein accession | YP_479368 |
Protein GI | 86738968 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCATC GACGACCGCG CGGGCGGCGT CTTTCCCTCC TGGCCGCCGG ACTTGCCGCA GCCGTCGCGG TGACCCTGGC CGCCTGTGGG AACGACGGCG GCAGCGGCAG CGCGGGTTCG GCCGCTACGG GTGGCCAGGT GACGCTGCGG ATCGGCGACC AGGGCAAGTA TTTGGAACTG CCGCTCACGC TCTCCGGCGA GGCGCGGGGC ACTTCCTACG GGTTGAGCTG GAACACCTTC GCCGACGGTC CGCACATGAA CGCCGCGTTC AGCGCGAGCA AGATCGACGT GGGCTTCATG GGCGATACAC CGGTACTGTT CGCGAACGGG GCGCACGCCG GCGTGACCGC GGTCGCTGTC GCGGAGAACC CGGTGAACAC CCAGACGATC ATCGTCAAGG CCGGCTCCGG GATTCACAAG CCAGCCGACC TCAAGGGCAG GCGCATCGCC CTCACCCTCG GTACCTCCCT GCACGGTTAT CTGCTCAACC AGCTCGCGTC CGCCGGTCTG AAGCAGAGCG ACATCACCCC GGTGAACGTG CCCATCACGA GCCTCGGAGC GACACTCGCC TCCGGTCGGG TCGACGCCAT CGTGTACGCC AAGCAGTACG TCGCCGCGGT CGGCCAGCAG GCTCCCGGTT CCTATGAGAT CGAGACCAAG CCGCTCCCGG TCTTCTCGGT CGTGCTGGCA TCGAAGAACA CCCTCAAGGA TCCGGCGAAG CGCACGGCGG TGCAGGACTT CCTGATCCGC CTGTCCCGGG CATCCGCCTG GCCGAAGGCC CATGAGGACG AGTGGATCAA GAAGTACTAC GTGGGCCAGC TCAAGCAGAA CCCGCAGACG GCGAGGAAGT ACTTCGACTC GCTGCCGCGG GCACTCTACA AGCCGGTCTC CGAAAGCTTC ATCGAGAGCC AGCGCGTGCA GGCCCGACTG CTGATCGACG TCGACCAACT GCCGAAGACC CTGAACGTCA ACGACGAGAT CGACAAGGGT TTCAATGCCG AGCTGACCGC CGCGTTCACC AAGGCCTCGC TGGCCACCTG A
|
Protein sequence | MQHRRPRGRR LSLLAAGLAA AVAVTLAACG NDGGSGSAGS AATGGQVTLR IGDQGKYLEL PLTLSGEARG TSYGLSWNTF ADGPHMNAAF SASKIDVGFM GDTPVLFANG AHAGVTAVAV AENPVNTQTI IVKAGSGIHK PADLKGRRIA LTLGTSLHGY LLNQLASAGL KQSDITPVNV PITSLGATLA SGRVDAIVYA KQYVAAVGQQ APGSYEIETK PLPVFSVVLA SKNTLKDPAK RTAVQDFLIR LSRASAWPKA HEDEWIKKYY VGQLKQNPQT ARKYFDSLPR ALYKPVSESF IESQRVQARL LIDVDQLPKT LNVNDEIDKG FNAELTAAFT KASLAT
|
| |