Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_3571 |
Symbol | |
ID | 3904510 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 4268635 |
End bp | 4269708 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637880892 |
Product | ABC transporter, substrate-binding protein, aliphatic sulphonates |
Protein accession | YP_482652 |
Protein GI | 86742252 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components |
TIGRFAM ID | [TIGR01728] ABC transporter, substrate-binding protein, aliphatic sulfonates family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCCGA GCAGACTGTG GCGTGCAGCG ACGCCACTCA TCGCCGCGGG GGCGCTTGTA CTGGCGGTGG TGGGCTGCTC GAGTGACGAC GGGGACTCCC AGACGGTCGC CTCGAACGGC CAGATCACCG GCACCCTACG GCTCGGGTAC TTCCCGAACC TCACCCACGC GCCAGCTCTC TACGGCGCCG AGAAGGGGAT CTTCGCCAAG GACCTCGGCT CTGGAGTCAC GCTCAAGACA TCGACCTTCA ACTCCGGCGT GCAGGAGGCG GAGGCGATTC TGTCCGGCGC CATCGATGCC GGTTATATCG GTCCCAACCC GGCGGTCAAC AGCTTCATCA AGTCGCACGG GGAGGCGGTG CGGATCGTCT CCGGCGCTAC CTCGGGTGGT GCGTCGCTGG TGGTCAAGCC TGAGATCACC TCCGTCGCGC AGCTCAAGGG CACCACGCTC GCCACCCCGA GCCTGGGCAA CACCCAGGAC GTGGCGCTGC GGTACTTCCT CAAGAAGAAC GGGCTGAAGA CCGACACCCA GGGTGGGGGT GATGTCTCCA TCAAGCCCCA GGACAACACC GTGACGGTGG ACGCCTTCGC CAACAGGGCC ATCGACGGGG CCTGGGTGCC CGAGCCGACG GCCTCCCGGC TCGTCGCAGC GGGGGGCAAG GTGCTCGTCG ACGAGGCCGA CGAGTGGCCG GAGACCAAGG GCCAGTTCGT CACGACGGTG CTTCTGGTGA GCACGGACTA TCTGAAGAAG AACCCGGAGA TCGTCCGCCG GCTCATCACG GCGAATGTGG AATCGATCAA CGCGCTCAAC GCCGACCGTG ACGCCGGCGC GAAGGTCACC AACACGGCGC TGGGCAAGCT GAGCGGCAAG CCGCTGTCCG ACAAGATCCT CACCTTGGCC TGGAAGGGGC TGACCTTCAC TCCGGACCCG ATCGCGGCGT CCCTGTTCAC CTCGGCAAAG CACCAGGAGG AGCTCGGCCT CATCAAGAAC CCGAAGCTCG ACGGTCTGTT CGATCTGACG GTCCTCAACG AGATCCTCGC CAAGCAGGGC AAACCGACGG TCGCCGACTC CTGA
|
Protein sequence | MRPSRLWRAA TPLIAAGALV LAVVGCSSDD GDSQTVASNG QITGTLRLGY FPNLTHAPAL YGAEKGIFAK DLGSGVTLKT STFNSGVQEA EAILSGAIDA GYIGPNPAVN SFIKSHGEAV RIVSGATSGG ASLVVKPEIT SVAQLKGTTL ATPSLGNTQD VALRYFLKKN GLKTDTQGGG DVSIKPQDNT VTVDAFANRA IDGAWVPEPT ASRLVAAGGK VLVDEADEWP ETKGQFVTTV LLVSTDYLKK NPEIVRRLIT ANVESINALN ADRDAGAKVT NTALGKLSGK PLSDKILTLA WKGLTFTPDP IAASLFTSAK HQEELGLIKN PKLDGLFDLT VLNEILAKQG KPTVADS
|
| |