Gene Cagg_3591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3591 
Symbol 
ID7269735 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp4367854 
End bp4369836 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content50% 
IMG OID643568399 
Producthypothetical protein 
Protein accessionYP_002464865 
Protein GI219850432 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00100041 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCGTTATC ATTTCTCGCG TATACAGCTT CTGCCATTGG CAATGCTCCT TGCACTGCTC 
GTCATTTCCC TAAGCGCAAC ATGGATCTTA GCATCACGTC CATGGCGGAT TGATGCAGTC
ATTGGCGGCG CTGACTCGGC TCTTGTTGGT TCGGGCTTTT TTACCAAAGA GCTGTCTTCT
GATGGTACGC CATTTCGCTG GACGAGCGGG CCTGCCATTA TCAATCTCCC ACCGGTGCAT
GCACGGTATA TTGTTACAAT GCGCGCCTAC GTTCCCAGTG ATGTGATTCC TTATTACGTT
GAGATCAAAG ACCGTGCCTT TCCAGTGGCC ACAATCGTGG TTACAGATCA GCTCCCCGCC
TTCCGCAGAT ACCATATCCT TTGGCAGTCA CCTGTAACCT ATCACTGGTT AGATTTGTTT
ACACCACGTC GTTTCACCAT TGATGCTGAA ACGCAACATC GTAACCAAGA TGACCCTCGG
TTGTTGGGGA TTGCAGTCAG CCAGCTGCAT ATTCGCAGTT CCAACACATT GGCAGTGCCG
GTGATGCCAC TTATTACCGT TGGGTTAACT CTGCTTGGCT TTGCTCATCT GCTCTGGCCG
TTGCGCGGCA AACGGCTTGT GTGGTTTGCT GTGGTTGCTC TCATCTTGCC AGTAGGATAC
GACCTTCTCG TATGGCACCC GCTTCAGGGA AATGATTACA CGTGGCTACC ATTATCGTGG
TTGCCGGGCA TGGTTGCGGC ATCGGTCATA GGTGTTGCAT TCGCTCAGCG TGCTGCTCTA
TCACGTGGGG GAGCGTGGTT CGCAGCTTTG ATTGTCATCT TATTGATGGT GGCCGTCATC
ACCACTTTGC AATGGCACTG GCTGGTTGAA GGACCTGATT ACCATTGGCA TCTGAACCAT
GGCGGTTCTT GGCGCCGTGT GTTCCGCTCC CACCCTTTCT ACCCGTTTGG CTTGCCATTG
ATTTTGTACG TAGGACAACT GGCTGGTGAC CAAGCACTGT TATTTGGACG TATTGCGGGG
GCTGTCACTA CGTCTGTAGC TATCGTAGCT GTTGTGCTAT TAGTATGGCG GGTAATCGCG
CCCGCATATG CATGGGTAGC GGGCATGATC ATGCTGGCAT CACCGGTCGT GGTATCTCAT
GGGGCTTTGG CGAGCACTGA CGCACCTATG ACCGGTTTGG CAACATTGGC ATTGCTTGCT
CTCCTGTGGC ACGAGCGGCT CCGTTGGCTT CAGATTGCAT TGGCCGGCAT GTGTCTTGGT
TTAGCGTATC TCTTCCGCGT TCAAGTGACA ATGTTGCTCA TCTCCTCCTT GCTGTGGTTG
TACTGGCAGT CGACTCCGGC ATTGCTGTCG CCGTCAACTC GACCGTTCGA TCCTCGTCGT
TTGATAGGGC CGCTGGTCTG TCTGGGAGGA TTTTTGCTGA CGTCAGCACC ACAGTGGATA
TTAGACATTC GGGATACTGG TTTTCCTTTT GTGACAAAGC AATACGTCAA TATTTGGGCA
TTCGCTTTTA GCCATTCTAA CCCGTTGCCT GACGGTTCAA CCTTTGAACA ACTGTGGTTT
ATTCTAACGT TTGATCCGTA CGCACTATGG CGTCATTGGC TCAGCAATAT CATCCAATTT
AGTACAGATA CGATTCATCG CTTATTTGTA TGGCCATTTG GTTTTATTGC ATTTGGTGGA
TTGATACTAA AAGGTGCTAT CTCTATCCGC CGTTACTGGC TTCTGCTGGT TTGGGTTGTA
GTTTATGTTT TATTTGTGAT GCTTACTGCA AATAAAGAAC GGTTCTTCTT GCCTGTCGTG
CCAGCATTGG CTGTTTTTGC TACTGCATTT CTTGCAGAGA TACATAATCG TGTTAGCCAA
TGGGGAAGGC GATTTGTGTT CCTACCGATA TTGGTGAATG CGCTGCTAAT GTACTGGATT
ATGATCCATC TTACGTTAGC AGAAGTTGAA TTGGCCGGTT ATGGATTTAC GCGGAATTGG
TAG
 
Protein sequence
MRYHFSRIQL LPLAMLLALL VISLSATWIL ASRPWRIDAV IGGADSALVG SGFFTKELSS 
DGTPFRWTSG PAIINLPPVH ARYIVTMRAY VPSDVIPYYV EIKDRAFPVA TIVVTDQLPA
FRRYHILWQS PVTYHWLDLF TPRRFTIDAE TQHRNQDDPR LLGIAVSQLH IRSSNTLAVP
VMPLITVGLT LLGFAHLLWP LRGKRLVWFA VVALILPVGY DLLVWHPLQG NDYTWLPLSW
LPGMVAASVI GVAFAQRAAL SRGGAWFAAL IVILLMVAVI TTLQWHWLVE GPDYHWHLNH
GGSWRRVFRS HPFYPFGLPL ILYVGQLAGD QALLFGRIAG AVTTSVAIVA VVLLVWRVIA
PAYAWVAGMI MLASPVVVSH GALASTDAPM TGLATLALLA LLWHERLRWL QIALAGMCLG
LAYLFRVQVT MLLISSLLWL YWQSTPALLS PSTRPFDPRR LIGPLVCLGG FLLTSAPQWI
LDIRDTGFPF VTKQYVNIWA FAFSHSNPLP DGSTFEQLWF ILTFDPYALW RHWLSNIIQF
STDTIHRLFV WPFGFIAFGG LILKGAISIR RYWLLLVWVV VYVLFVMLTA NKERFFLPVV
PALAVFATAF LAEIHNRVSQ WGRRFVFLPI LVNALLMYWI MIHLTLAEVE LAGYGFTRNW