Gene Cpha266_2363 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_2363 
Symbol 
ID4569621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp2748129 
End bp2749325 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content49% 
IMG OID639766921 
Productaminotransferase, class V 
Protein accessionYP_912775 
Protein GI119358131 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.197402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGTTT ATTTTGATAA CAATGCCACA ACGCCGCTTC ATCCTGAAGT AAAAAAAGAG 
CTGACGGCTG CAATGGAAAT GTTTGGCAAT CCGTCGAGCA TGCACTCCTA TGGTCGTGAA
GCGAAGGCAA ATGTCGAGGA TGCCAGAATT CGGGTTGCCA GGTTTATCGG GGCTCATGAA
AACGAGATCG TTTTTGTCGG AAGCGGTTCA GAGGCAAACA ATACCGTGCT TTCGCTTTTT
GTCTGCTCAT CCAGGCAGTG TATTCCCGGA CTGAAAATGC GCAACACCAT TATTACGACG
AAAATCGAGC ATCCCTGCGT GCTTGAGACA TCCGATTGTC TTGCGCATCG AGGGGTGAAG
GTGAAGTATC TTGATGTTGA TCAGTACGGT AAAATTTATC TCGATCAGCT TGACTCGATG
CTCGACGACA GTGTCGGTCT GGTTTCAGTC ATGATGGCAA ATAATGAAAT TGGTACCATG
CAGGATATTG CGGCCATTAC CGAAATGGTG CATGAACGCG GCGCGTATAT GCATACCGAT
GCCGTTCAGG CGGTTGGCAA AGTTCCTGTT GACGTCGGGG CGTTTGGTGT CGATTTTCTT
ACTATTTCAG GGCATAAAAT ATATGGGCCG AAAGGAGTCG GTGCTCTTTA TGTGAAAAAC
GGCATACCAT ACTGCCCGTT TATCAGAGGG GGCCATCAGG AGAAGGGCAG GCGAGCAGGA
ACCGAGAACA CTCTCGGCAT TATGGGTCTT GCCAGGGCTG TTGATATGCG AGTGCTTGAA
ATGGACGATG AACACCAGAG GCTGCTTGTT CTTAAGCAGG CGTTGCGAAA GGGAATAGAG
GAGCGTATTG ATGATATTTA TTTCAACGGT CATCCAACGG ATTCACTTGC CGGCACGCTC
AACGTCTCGT TTCCCGGAGC AGAGGGTGAA GCTATTCTTC TCTATCTTGA TCTTGAGGGG
ATAGCGGTAT CAACCGGGTC AGCCTGTGCA TCAGGCTCAC TTGATCCCTC GCATGTGCTG
CTTGCGACCG GTGTTGACGC CGAAAGAGCG CACGGATCCA TCCGGCTGAG TCTCGGAAGA
GAGAGCACGA TGGAAGAGGT TGATTATGTG CTTGATGTTT TACCACGAAC TATCGAACGA
ATCAGAAACA TGTCTACGGC ATACATAAAA GGAGGACTCC ATGCTACAAG CAGGTGA
 
Protein sequence
MKVYFDNNAT TPLHPEVKKE LTAAMEMFGN PSSMHSYGRE AKANVEDARI RVARFIGAHE 
NEIVFVGSGS EANNTVLSLF VCSSRQCIPG LKMRNTIITT KIEHPCVLET SDCLAHRGVK
VKYLDVDQYG KIYLDQLDSM LDDSVGLVSV MMANNEIGTM QDIAAITEMV HERGAYMHTD
AVQAVGKVPV DVGAFGVDFL TISGHKIYGP KGVGALYVKN GIPYCPFIRG GHQEKGRRAG
TENTLGIMGL ARAVDMRVLE MDDEHQRLLV LKQALRKGIE ERIDDIYFNG HPTDSLAGTL
NVSFPGAEGE AILLYLDLEG IAVSTGSACA SGSLDPSHVL LATGVDAERA HGSIRLSLGR
ESTMEEVDYV LDVLPRTIER IRNMSTAYIK GGLHATSR