Gene Cagg_1939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_1939 
Symbol 
ID7268855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2373839 
End bp2375005 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content57% 
IMG OID643566777 
Productaminotransferase class V 
Protein accessionYP_002463270 
Protein GI219848837 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1104] Cysteine sulfinate desulfinase/cysteine desulfurase and related enzymes 
TIGRFAM ID[TIGR03402] cysteine desulfurase NifS 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00024706 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTGAGC ACCTCCTGTA TTTCGACTAT GCCGCAACGA CACCGCTCGA CCCGTGCGTG 
CTCGAGGCAA TGATGCCGTT TTTGAGCGGG ATGAGTGGAA ATGCCTCGAG TATCCATCAA
GTAGGGCGAG CGGCATTGCA AGCGCTCGAC GATGCCCGTG AACAAGTAGC GTTCGTCTTG
GGCTGCCAAC CGAAGGAGAT TGTCTTTACG GGTGGCGGCA GCGAGAGCAT TAATCTGGCG
ATCAAGGGTG TCGCTATGGC ACTACGTGCG CACGGGAAGA CGCACGTGAT CAGTAGTGCC
GTTGAACATC ACGCTGTCTT GCACGCGATC GATTACCTTG TGGAGTATGA AGGGTTTCGT
GCGACCCTTC TCCCCGTCGA CCGGAGTGGC CGGGTCAACC CTGCCGATCT AAGCGCTGCT
ATCCGTCACG AAACGGCATT AGTATCGGTG ATGTACGCTA ATAACGAGAC GGGAGTGATC
CAGCCGATTG CCGAACTGGC CGCCATCTGT CACGAGCATG GGGTGTTATT CCACACCGAT
GCGGTGCAGG CCCCCGGCCA ATTGCCGCTG GATGTGCAAG CGCTTGGTGT TGATTTACTC
AGTCTGACAG CCCATAAGTT CTACGGTCCA CAGGGAGTGG GAGTACTCTA CATGCGCCGT
GGCACACCAC TGGTGCCGCA GATCAACGGT GGTGCTCAAG AACGCCGACG GCGAGCCGGG
ACAGAGAACA TTGCCGGCAT TGTCGGATTA GCAACGGCGT TAACGATTGC CGAGCGTGAG
CGAGCACAGT ATGTCCATCA GTTACGTACA CTCAGTGAGC GACTGATCGG TGGCGTATTG
CAGCGTATTC CGCACTCGTG GCTCAACGGC GACCGAACCT GTCGATTGCC GAGTATCGTC
AATCTCGGCT TTGCGTGTAT TGAAACCGAG AGTCTCTTGC TCTTACTCGA CCAGCGCGGC
ATCTGCGCCA GCTCAGGCAG CGCCTGTACA TCTGGTTCAC TGGAACCCTC GCACGTCCTG
TTGGCGATGG GACTTTCGCC CGAAGAAGCG AATGGCTCGA TCCGATTTTC GCTAGGTCGT
CATACCACCG CCGAACAGAT CGAGACCTTG CTCGAACTGT TGCCCGATCT TGTCGCGCAA
TTGCGTGCAG TAGCACCGTG TGCGTGA
 
Protein sequence
MPEHLLYFDY AATTPLDPCV LEAMMPFLSG MSGNASSIHQ VGRAALQALD DAREQVAFVL 
GCQPKEIVFT GGGSESINLA IKGVAMALRA HGKTHVISSA VEHHAVLHAI DYLVEYEGFR
ATLLPVDRSG RVNPADLSAA IRHETALVSV MYANNETGVI QPIAELAAIC HEHGVLFHTD
AVQAPGQLPL DVQALGVDLL SLTAHKFYGP QGVGVLYMRR GTPLVPQING GAQERRRRAG
TENIAGIVGL ATALTIAERE RAQYVHQLRT LSERLIGGVL QRIPHSWLNG DRTCRLPSIV
NLGFACIETE SLLLLLDQRG ICASSGSACT SGSLEPSHVL LAMGLSPEEA NGSIRFSLGR
HTTAEQIETL LELLPDLVAQ LRAVAPCA