Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_3027 |
Symbol | |
ID | 4811099 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 3549650 |
End bp | 3551020 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640108448 |
Product | citrate synthase |
Protein accession | YP_001039416 |
Protein GI | 125975506 |
COG category | [C] Energy production and conversion |
COG ID | [COG0372] Citrate synthase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000187967 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGAATG CTAATTTTAT GGAGTTTGAA AAAAACAAAC TGGAAGAACT GACCGAATTG GCCACAAAAT GCAGTTATAT TGATCCGGAA CTTTTTGTCA AATACCAGGT AAAGCGCGGA CTGAGAGATT TGGACGGCAA GGGCGTGCTT GTGGGTTTGA CGGAGATTGG AGAAGTCCAC TCGTATATCA TTGATGAAAA TGAAATAGTT CCTGTTCCGG GTAGATTAAT TTACAGGGGA ATAGATATTT TTGATTTGGT TGACGGTTTT ATCAGTGAGG GACGTTTTGG TTTTGAAGAA ACTACATATT TGCTGCTTTT CGGTGATTTG CCGACCAGGG AACAGTTGAA TGAATTTGAA AAGCTTCTTT CTTCCTATCG CGAACTTCCG GAAGAGTTTC TCAATGATAT GATTTTGAAT CTTCCGGGCA AGGACATCAT GAATGTCCTG GCAAGAAGCG TGCTTGCCTA TTACACTTAT GACAGCAATC CGGATGATAC TTCGGTCAAA AATGTTTTAA GGCAGTGCAT GCGTTTGATT GGGAGCATGC CTACAATTGC CGTTTATGCT TACCAGGCAA TGGCTCATTA CTATGACAGA AAGAGTCTGG TTATTCATCC CCCAAGGCCC GAGCTAAGTA CGGCGGAGAA CATTTTGCAC ATGTTAAGGC CCGACAGCAA GTATACGGAC CTTGAAGCAA GACTGCTTGA CCTTGCGCTT GTATTGCATG CGGAACATGG AGGCGGTAAT AACTCAACCT TTGTAACCCA TGTGGTTACA TCTACCGGAA CGGACACATA CTCTGTTATT GCTGCGGCTC TCGGTTCGCT CAAAGGTCCT AGACACGGAG GAGCAAATAT CAAGGTTTGC CAAATGTTTG AGGACTTGAA GCAGAACGTC AGGGACTGGA AGGATGACGA AGAAATTGAA AATTATCTGA TGAAGCTGCT AAATAAAGAG GCTTTTGACC GTGCAGGACT GATTTACGGT ATAGGTCATG CAGTTTATTC CATTTCCGAC CCGAGATGTA TTCTTCTTAA GGAACAGGCG GAAAAACTTG CCAAAGAGAA AGGACTTGAA GATGAATTTG AACTGTATGA CAGGGTGGAA AGATTGGCTC CTCAAATTAT CGGAAGAGTT CGTAAAATGT ATAAGGGTGT TAGTGCCAAC GTGGATTTCT ACTCAGGTTT TGTTTATAAA ATGCTGAACC TTCCTCTGGA ATTGTATACT CCAATTTTCG CCATTTCAAG AATGTCGGGT TGGGCTGCTC ACCGTATAGA GGAAATTGTC AATGCCGGAA AAATCATCAG ACCGGCTTAC AAGAGCGTTG CCGAAAGACG CGATTATGTT GCTATGAAAG ACCGAAAATA A
|
Protein sequence | MMNANFMEFE KNKLEELTEL ATKCSYIDPE LFVKYQVKRG LRDLDGKGVL VGLTEIGEVH SYIIDENEIV PVPGRLIYRG IDIFDLVDGF ISEGRFGFEE TTYLLLFGDL PTREQLNEFE KLLSSYRELP EEFLNDMILN LPGKDIMNVL ARSVLAYYTY DSNPDDTSVK NVLRQCMRLI GSMPTIAVYA YQAMAHYYDR KSLVIHPPRP ELSTAENILH MLRPDSKYTD LEARLLDLAL VLHAEHGGGN NSTFVTHVVT STGTDTYSVI AAALGSLKGP RHGGANIKVC QMFEDLKQNV RDWKDDEEIE NYLMKLLNKE AFDRAGLIYG IGHAVYSISD PRCILLKEQA EKLAKEKGLE DEFELYDRVE RLAPQIIGRV RKMYKGVSAN VDFYSGFVYK MLNLPLELYT PIFAISRMSG WAAHRIEEIV NAGKIIRPAY KSVAERRDYV AMKDRK
|
| |