Gene Clim_1851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1851 
Symbol 
ID6355192 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2031024 
End bp2032244 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content60% 
IMG OID642669455 
Productglycosyl transferase group 1 
Protein accessionYP_001943869 
Protein GI189347340 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AACCCATAGC TTACCTCTGC AGCGAGTACC CGGCCATCTC CCACACCTTC 
ATCTACCGGG AAATCCAGTC GCTCCGCAAA GAAGGGTTCA CCGTGCACAC CGCATCCATC
CACAAACCCG GCGGTCTCGA CATCATGACC CCCGACGAAC AGGAAGAGGC CCGAAACACC
CTCATGGTGC TCGACCATTC GATACCGGCA ATCGCCGGAG CACACATCCG CTGCCTCGCC
GGCAACCCCA AAGGCTACCT CCGCATGGCC ACAGCAGCGC TCGGGCTACT CGTCTCGGGT
CCAAAAAGTC CGCTGAAAGC CATAGCCTAC TTCGCCGAAG CTGGCATCCT GCTCCAGTGG
ACGCGCCTGA ACGGCATCGC CCACATCCAC GAACACTTCG CCAACCCCAC AGCCATAGTC
ACCATGCTCA TGAAACAATA CGGCGGCATC ACCTACAGTA TCTCCGTGCA CGGCCCTGAC
ATATTCTACA CCGTCGACAC AGCCATGCTC CAGGAAAAAA TCAGGCAAGC CTCCTTCGTG
CGATGCATCA GCCACTATTG CCGCAGCCAG GTCATGCGCC TCAGCGACCC AGTCATCTGG
AACCGCTTCC ACATCGTACG CTGCGGCATT GACCCCGATC TCTACGCTCC GCGCCCCGAC
CCCGGCAACG CCGTTCCCCG GCTCCTCTGC GTCGGAAGGC TTGTACCTGC CAAAGGCCAG
CACATTCTGC TCGAAGCCTG CGCCATCCTT AAACGTCAAG GCACCCCCTT CCATCTCACG
TTGACCGGCG ACGGGCCTGA TCGCGCTTCG CTCGAACAGC ATTCCCGCAC ATGGGGTATT
CAGGAACTCG TCACCTTCAC TGGCGCGCTC GGACAGGACA ACGTCCGCCT GCTTTACGAC
CAGGCCGACA TCTTCGTGCT TGCAAGCTTC GCCGAAGGCG TTCCGGTAGT GCTCATGGAA
GCCATGGCCA AGGAAATACC CGTCATCTCC ACACGAATCA CCGGCATCCC CGAGCTCATC
GACCATCAGC ACGACGGCCT GCTTGCCATA CCCGGCGACC CCGTAGACCT CGCACTGCAG
CTCACCATGC TGCTTGCCGA CCCGACACTG CGCCGACAAT ACGGTAGGGT CGGCCGTCAG
AAAGTGATCG AACGATACAA TCAGCACCGA AACAACGCTC GACTCGGCGA ACACTTCAGG
AACCAGTACA GCAACCCATG A
 
Protein sequence
MKKQPIAYLC SEYPAISHTF IYREIQSLRK EGFTVHTASI HKPGGLDIMT PDEQEEARNT 
LMVLDHSIPA IAGAHIRCLA GNPKGYLRMA TAALGLLVSG PKSPLKAIAY FAEAGILLQW
TRLNGIAHIH EHFANPTAIV TMLMKQYGGI TYSISVHGPD IFYTVDTAML QEKIRQASFV
RCISHYCRSQ VMRLSDPVIW NRFHIVRCGI DPDLYAPRPD PGNAVPRLLC VGRLVPAKGQ
HILLEACAIL KRQGTPFHLT LTGDGPDRAS LEQHSRTWGI QELVTFTGAL GQDNVRLLYD
QADIFVLASF AEGVPVVLME AMAKEIPVIS TRITGIPELI DHQHDGLLAI PGDPVDLALQ
LTMLLADPTL RRQYGRVGRQ KVIERYNQHR NNARLGEHFR NQYSNP