Gene Cagg_0140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0140 
Symbol 
ID7266879 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp187392 
End bp188447 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content53% 
IMG OID643565012 
Producthypothetical protein 
Protein accessionYP_002461527 
Protein GI219847094 
COG category[S] Function unknown 
COG ID[COG0392] Predicted integral membrane protein 
TIGRFAM ID[TIGR00374] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00167342 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGATC GGCCATCGGT GGTTGACATC ACGAACGATC CACCCGATTC CAACCAAACA 
GAGCGCAACG ATTTCTCGCT TGGTCAACGA TTGCGTCAAC CGCGTACTCT TATCTCGTTT
GGATTGGCAA TCGCGATTAT TGTATTTGTC GTGCGTGGTC TCGACATTGA CCTTGCGACC
ACGTGGCAAT ATATGCGTTC AGCCGATCTG TGGCTCCTTG GGTTAGGGCT GGTGGTTTTT
TATCTCACCT TCCCTTTACG CGCGCTCCGC TGGCGCATGT TACTGATCAA CGCCGGTGTA
CCGTTGCAGG CCGGGCGACA CAGTTGGGCT TCATTGCCGG CGTTAATCGA GTATATTTAT
CTGTCGTGGT TTGCCAATTG CATTGTACCG GCCAAGCTCG GCGATGCATA CCGTGGGTAT
CTGCTCAAAC ACAACGGTAA CGTCTCCTTT TCGGCCACCT TCGGCACGAT TTTTGCCGAA
CGGTTGCTTG ACATGATCGG CCTATTCAGC TTGCTCGTGA TCTCCGGGTA TCTTACCTTT
GGTGCGCATA TGCCCGAAGG GACGCAGATT GTATTTGGCT TTGGGGCACT CTTGGTCGTG
ATTATTATCA GTGGTCTGGC CGGTATGCGC TGGCTGGGGC CGCAGATACG CTGGTTTATT
CCACGCCGTT TGCATCGCGT CTACGGCAAT TTTGAGCAAT CGGCGCTGCG GTCGTTTACG
CCGGCAATCC TTCCCCGCTT ACTTGCCTTC ACCGGTGCTA TTTGGTTACT TGAGGGTTTT
CGGCTTTGGT TTGTTATTCA GTCCTTGAAT CATACCGGTC TGGATCTGAA CCTCGCCGCG
ATTATTTTTG TTGCGTTAGC ATCGTCATTG CTGACAGCAC TACCGATTAC CCCCGCCGGT
TTGGGAGTGG TCGAGGGAAC GATTACCGTC GTATTGACTC TCTTCGGCAT TGCAACGAGT
CTTGGTGGGG CAGTTACGCT GCTCGACCGG CTGATCAACT TCTGGAGTAT TGTCGTGTTT
GGTTTCATCC TCTATCTGTT CAGCCGGCGG AAGTAG
 
Protein sequence
MSDRPSVVDI TNDPPDSNQT ERNDFSLGQR LRQPRTLISF GLAIAIIVFV VRGLDIDLAT 
TWQYMRSADL WLLGLGLVVF YLTFPLRALR WRMLLINAGV PLQAGRHSWA SLPALIEYIY
LSWFANCIVP AKLGDAYRGY LLKHNGNVSF SATFGTIFAE RLLDMIGLFS LLVISGYLTF
GAHMPEGTQI VFGFGALLVV IIISGLAGMR WLGPQIRWFI PRRLHRVYGN FEQSALRSFT
PAILPRLLAF TGAIWLLEGF RLWFVIQSLN HTGLDLNLAA IIFVALASSL LTALPITPAG
LGVVEGTITV VLTLFGIATS LGGAVTLLDR LINFWSIVVF GFILYLFSRR K