Gene Glov_3677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_3677 
Symbol 
ID6369582 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010815 
Strand
Start bp31795 
End bp33105 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content65% 
IMG OID642679091 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_001953896 
Protein GI189426720 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.286824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACCC AGATCGAATA CGCCCGCGCA GGGGTGATCA CCCCCCAGAT GCAGCAGGTT 
GCTGCCACGG AAGGGCTGGC AGCTGAGCTG ATCCGCCAGC GGGTCGCGGC CGGCACCATC
GTGATCCCCT GGAACCACAA CCGCAAACCG TCCCGGATCG CCGGGATCGG CCAGGGGCTC
CGCACTAAGG TCAACGCCTC CATCGGCACC TCGTCCGATA TCATCGACTA TGCCGCCGAG
GTCCGCAAGG CGCTGGCCGC CCAGGAGTCA GGGGCCGACA CCCTGATGGA GCTGTCCGTG
GGGGGTGATC TGGACCGGGT GCGGCGGGAG GTGATCGCCG CCGTGGAGCT CCCGGTGGGC
AATGTGCCGC TCTACCAGGC CTTTTGTGAT GCAGCCCGCA AATACGGTGA CCCCAACAAG
CTGGACCCGG AGGAGCTGTT TGACCTGATC GAACAGCAGT GCGCGGATGG CATGGCCTTC
ATGGCCGTGC ACTGCGGCAT CAACCGCTGC ACGGTGGAGC GGCTGCAGAA GCAGGGTTAC
CGCTACGGCG GCCTGGTCAG CAAGGGCGGG GTCAGCATGG TGGCCTGGAT GCTGGCCAAC
AACCGCGAAA ACCCGCTCTT TGAGCAGTTC GACCGGGTGG CAGCCATCCT CAAAAAATAC
GATACGGTGC TCTCGCTGGG TAACGGCCTG CGGGCCGGCG CCATCCACGA CTCCTCCGAC
CGGGCCCAGA TCCAGGAGCT GGTCTTTAAC TGCGAGCTGG CTGAACTTGG GCGGGAGATG
GGCTGCCAGA TGCTGGTGGA AGGACCGGGC CATGTGCCGC TGGACGAGAT CGAAGGCAAC
ATCAAACTGC AGAAGCGGAT GAGCGGCGAT GCCCCCTACT ATATGCTGGG GCCGATCCCC
ACCGACGTGG CCCCCGGCTT CGACCATATC ACCTCTGCCA TCGGCGCGGC CCAGTCCGCC
CGCTATGGCG CCGACCTGAT CTGCTACATC ACCCCGGCCG AGCACCTGGC CCTGCCCAAT
GAGCAGGATG TGCGCGAAGG GGTCAAGGCC GCCAAAATTG CCGCCTATAT CGGCGATATG
AATAAATACC CGGAGCGGAT GCGGGAGCGG GACAAGGCCA TGGCCAAGGC CCGGCGGGAC
CTGGACTGGC AGAAGCAGTT CGAACTGGCC CTTTTCCCGG AGGATGCCAA GGCGATCCGC
GCCAGCCGTA TCCCTGAGGA TGAGGCCACC TGCACCATGT GCGGCAACTT CTGCGCCTCC
CGCGGCGCAG GCAAGCTGTT TGCGGAGCAT CTGTGCGGGG ACAAGTGCTG A
 
Protein sequence
MQTQIEYARA GVITPQMQQV AATEGLAAEL IRQRVAAGTI VIPWNHNRKP SRIAGIGQGL 
RTKVNASIGT SSDIIDYAAE VRKALAAQES GADTLMELSV GGDLDRVRRE VIAAVELPVG
NVPLYQAFCD AARKYGDPNK LDPEELFDLI EQQCADGMAF MAVHCGINRC TVERLQKQGY
RYGGLVSKGG VSMVAWMLAN NRENPLFEQF DRVAAILKKY DTVLSLGNGL RAGAIHDSSD
RAQIQELVFN CELAELGREM GCQMLVEGPG HVPLDEIEGN IKLQKRMSGD APYYMLGPIP
TDVAPGFDHI TSAIGAAQSA RYGADLICYI TPAEHLALPN EQDVREGVKA AKIAAYIGDM
NKYPERMRER DKAMAKARRD LDWQKQFELA LFPEDAKAIR ASRIPEDEAT CTMCGNFCAS
RGAGKLFAEH LCGDKC