Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Glov_3677 |
Symbol | |
ID | 6369582 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacter lovleyi SZ |
Kingdom | Bacteria |
Replicon accession | NC_010815 |
Strand | - |
Start bp | 31795 |
End bp | 33105 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 642679091 |
Product | thiamine biosynthesis protein ThiC |
Protein accession | YP_001953896 |
Protein GI | 189426720 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0422] Thiamine biosynthesis protein ThiC |
TIGRFAM ID | [TIGR00190] thiamine biosynthesis protein ThiC |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.286824 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAACCC AGATCGAATA CGCCCGCGCA GGGGTGATCA CCCCCCAGAT GCAGCAGGTT GCTGCCACGG AAGGGCTGGC AGCTGAGCTG ATCCGCCAGC GGGTCGCGGC CGGCACCATC GTGATCCCCT GGAACCACAA CCGCAAACCG TCCCGGATCG CCGGGATCGG CCAGGGGCTC CGCACTAAGG TCAACGCCTC CATCGGCACC TCGTCCGATA TCATCGACTA TGCCGCCGAG GTCCGCAAGG CGCTGGCCGC CCAGGAGTCA GGGGCCGACA CCCTGATGGA GCTGTCCGTG GGGGGTGATC TGGACCGGGT GCGGCGGGAG GTGATCGCCG CCGTGGAGCT CCCGGTGGGC AATGTGCCGC TCTACCAGGC CTTTTGTGAT GCAGCCCGCA AATACGGTGA CCCCAACAAG CTGGACCCGG AGGAGCTGTT TGACCTGATC GAACAGCAGT GCGCGGATGG CATGGCCTTC ATGGCCGTGC ACTGCGGCAT CAACCGCTGC ACGGTGGAGC GGCTGCAGAA GCAGGGTTAC CGCTACGGCG GCCTGGTCAG CAAGGGCGGG GTCAGCATGG TGGCCTGGAT GCTGGCCAAC AACCGCGAAA ACCCGCTCTT TGAGCAGTTC GACCGGGTGG CAGCCATCCT CAAAAAATAC GATACGGTGC TCTCGCTGGG TAACGGCCTG CGGGCCGGCG CCATCCACGA CTCCTCCGAC CGGGCCCAGA TCCAGGAGCT GGTCTTTAAC TGCGAGCTGG CTGAACTTGG GCGGGAGATG GGCTGCCAGA TGCTGGTGGA AGGACCGGGC CATGTGCCGC TGGACGAGAT CGAAGGCAAC ATCAAACTGC AGAAGCGGAT GAGCGGCGAT GCCCCCTACT ATATGCTGGG GCCGATCCCC ACCGACGTGG CCCCCGGCTT CGACCATATC ACCTCTGCCA TCGGCGCGGC CCAGTCCGCC CGCTATGGCG CCGACCTGAT CTGCTACATC ACCCCGGCCG AGCACCTGGC CCTGCCCAAT GAGCAGGATG TGCGCGAAGG GGTCAAGGCC GCCAAAATTG CCGCCTATAT CGGCGATATG AATAAATACC CGGAGCGGAT GCGGGAGCGG GACAAGGCCA TGGCCAAGGC CCGGCGGGAC CTGGACTGGC AGAAGCAGTT CGAACTGGCC CTTTTCCCGG AGGATGCCAA GGCGATCCGC GCCAGCCGTA TCCCTGAGGA TGAGGCCACC TGCACCATGT GCGGCAACTT CTGCGCCTCC CGCGGCGCAG GCAAGCTGTT TGCGGAGCAT CTGTGCGGGG ACAAGTGCTG A
|
Protein sequence | MQTQIEYARA GVITPQMQQV AATEGLAAEL IRQRVAAGTI VIPWNHNRKP SRIAGIGQGL RTKVNASIGT SSDIIDYAAE VRKALAAQES GADTLMELSV GGDLDRVRRE VIAAVELPVG NVPLYQAFCD AARKYGDPNK LDPEELFDLI EQQCADGMAF MAVHCGINRC TVERLQKQGY RYGGLVSKGG VSMVAWMLAN NRENPLFEQF DRVAAILKKY DTVLSLGNGL RAGAIHDSSD RAQIQELVFN CELAELGREM GCQMLVEGPG HVPLDEIEGN IKLQKRMSGD APYYMLGPIP TDVAPGFDHI TSAIGAAQSA RYGADLICYI TPAEHLALPN EQDVREGVKA AKIAAYIGDM NKYPERMRER DKAMAKARRD LDWQKQFELA LFPEDAKAIR ASRIPEDEAT CTMCGNFCAS RGAGKLFAEH LCGDKC
|
| |