Gene Clim_2175 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2175 
Symbol 
ID6355969 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2412896 
End bp2414092 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content52% 
IMG OID642669766 
Productaminotransferase class V 
Protein accessionYP_001944178 
Protein GI189347649 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.670004 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT ATTTTGATAA TAATGCAACC ACTCCTCTGC ATCCCGAAGT TAAAAAGGAG 
TTGATCGAAG CGATGGGGAT GTTCGGTAAC CCTTCGAGTA TGCATGCCTG GGGCCGCGAG
GCTCGGGCCA ATGTCGAGGA TGCCCGGAGT CGGGTGGCCG GTTTTATTGG AGCGCATGAC
GATGAGATTG TTTTTGTCGG CAGTGGTTCC GAAGCCAATA ATACCGTGCT CTCTCTTTTT
GTCTGCGCCT CAAACCAGTG TATTCCCGGT ACCAGGATGC GCAGTTCCAT TATTACGACG
AAAATTGAAC ACCCCTGTGT GCTTGAAACC TCGGAATGTC TCGCTCACCG GGGGGCAAGG
GTAAAGTATC TCAATGTTGA CCGTTACGGA AAAGTCGATC TCGATCAGCT TGCCGGTATG
CTTGGAGATG ATGTCGGTCT TGTTTCGGTT ATGATGGCGA ATAATGAGAT CGGTACGCTG
CAGGATATTG AAACCATATC GAAAATGGTG CATGAGTGCG GTGCGCTGAT GCACACGGAT
GCTGTTCAGG CGGTCGGAAA GATTCCGGTT GACGTCGCCA TGCTCGGGGT GGATTTTCTT
ACGCTTTCGG CTCATAAAAT ATATGGACCG AAAGGGGTTG GAGCTCTCTA TGTGAAAAAA
GGCATTCCTT ACTGTCCGTT CATCCGCGGA GGTCATCAGG AGAGAGGTCG TCGGGCGGGA
ACTGAAAATA CGCTTGGCAT TCTTGGTCTC GGAAAGGCCG TCGAAATGCG ACAGCTCGAA
ATGGAGTCTG AAGAAAAGCG ACTGGCCGGG ATGAAAGCGG TTCTTAAAAA AGGCATTGAA
GAGCGGATCG ACGATATTTA TTTCAACGGG CACCCGACCG ACTCCCTTTC GGGAACCCTG
AACGTTTCGT TTCCCGGAGC TGAGGGAGAG GCGATTCTGC TCTATCTCGA TCTTGAAGGC
ATTGCGGTTT CAACCGGGTC GGCCTGCGCC TCAGGATCTC TCGACCCGTC TCATGTACTG
CTGGCAACGG GAGTCGATGC AGAGCGAGCG CACGGATCCA TCCGTATCAG CCTCGGGCGG
GAAAGCACCA TGCAGGAGGT CGAGTACATG CTCGATATAC TGCCTAAAAC AATTAAACGG
ATAAGAGACA TGTCAACGGC ATACATTAAA GGAGGAACAC ATGCTGCAAG CAGGTGA
 
Protein sequence
MKVYFDNNAT TPLHPEVKKE LIEAMGMFGN PSSMHAWGRE ARANVEDARS RVAGFIGAHD 
DEIVFVGSGS EANNTVLSLF VCASNQCIPG TRMRSSIITT KIEHPCVLET SECLAHRGAR
VKYLNVDRYG KVDLDQLAGM LGDDVGLVSV MMANNEIGTL QDIETISKMV HECGALMHTD
AVQAVGKIPV DVAMLGVDFL TLSAHKIYGP KGVGALYVKK GIPYCPFIRG GHQERGRRAG
TENTLGILGL GKAVEMRQLE MESEEKRLAG MKAVLKKGIE ERIDDIYFNG HPTDSLSGTL
NVSFPGAEGE AILLYLDLEG IAVSTGSACA SGSLDPSHVL LATGVDAERA HGSIRISLGR
ESTMQEVEYM LDILPKTIKR IRDMSTAYIK GGTHAASR