Gene Cagg_3056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_3056 
Symbol 
ID7269473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3716542 
End bp3718701 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content54% 
IMG OID643567876 
Producthypothetical protein 
Protein accessionYP_002464350 
Protein GI219849917 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000115071 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAACGATG CTGTCACCGT CAACTCATCC AACTGGTGGA ACCAACCACT CACGGCAACA 
GGCTGGCTAA ACCATCTCCG CCGCTACGAA GCGGTGTTGC TATTCGCAAG CCTCGCGCTT
TACCTCTATA CCCGTTTCAC CGATCTGAGT GCATTCCCGA TCTATTTTTT TTGTGATGAA
GCGATTCATG GCGTTTTAGC CCGTGATCTG GTCGCCAACG GGTTTCGGGA TAGCAATGGA
GTGTGGTTGC CACCGTACTT TCGCAACGCC GAAAAGTGGA GCCTTAGCCT GAGCGTTTAT
GTTCATGCAA TAAGTATACT TTTCTTTGGG TTTGACCGCT CGGTCTGGAT GACCCGCGCT
CCATCGGTTA TCGTCGGTCT GCTGGCCCCT ATCGGTATTG CCGCACTCCT TCGCTTTGGC
TACCACAACC GGTATTGGTG GCTCGGTATT CTCGCCCTCT GTGATATGCC GGCGTGGTTT
CTCCATTCAC GCACCGCGTT TGAAACGGTG CTGATGGTTG CGTTCTATGC ATGCTTTCTC
GCAGCATACG TGGCCTATCG CCATTACGAC GACCGCTGGA TTGTGGCGGT GATTTTGTTT
GGAGCGGCCA CCTTCTACAG TTACGCCAAC GGCCAGGGCG TGATGTTGAT CAGTAGCCTG
CTGCTGTTGA TCAGCGATGC TCGGTACCAT GTTAGCCGAC CACCACAACG CCTGTGGATG
GCACTTGCGC TGTTCGCCGT TGTACTGATA CCACTGGTCC GTTTTCGCCA ACTCGAACCT
GACGCCACAG TTGAACATCT TCGTACCCTC AATTCGTACT GGTTCCAACC GATGCCGCTT
GGCGAAAAGA TCCAACGTTT TCTTACGTTG TACGGTCAAG GACTCGATCC ACGTTATTGG
TTTTGGCCCA ATACTATCGA TCTCGACCGG CATCGCTTTC CTGATCGCGG CCATATACCG
CTGCTGTTGT TGCCGTTCAT CACCATCGGA CTAGGAGTGT GTCTGTGGCG ATGGTGCGAT
TCACGCTACC GGCTGCCTAT CATTGCCGTA CTTGCGGCAC CCTTCAGTAG CGCGCTGGTC
GGTATTGCTG TAACCCGAAC CCTGGCAATG GTGGTACCGG CGGCTCTGTT GAGTGTCATC
GGACTGAAGG CGGTGATCGA ACGCCTCGTA CCATATCGGC AACGGCTTGT GGCGCTCACG
TTGGGGCTTG TTCTCGCCCT CGACAGTCTC ATCCTGCTGC GAACTGCCGT GCTCGGTGGA
CCACTTTGGT TCCGCGACTA CGGCTTGTAC GGTATGCAAT ACGGCGCACC TCAGATCTTC
GGCGAGACGG TACCGATGCT GTTGGCGCGT GACCCCCAAG CCATAGTACG GGTATCGCCG
GTGTGGGCGA ACAATCCGAA CAGTTTTGTT GATTTTTTTC TCGATCCGGC ACAACGCCGA
CGTGTTAGCC TGACCAGCAT CGATGCCTAT CTGTACTATC GCGGTGATCT TGATCGACAA
CGCGATATTT TTATTATGAC CGCCACCGAG TATCAGCAAG CACAAGAAAG CGGCAAGTTT
ATTATTGAAC CGCCTGAACA GATCATTCCA TACCCTGATG GCCAACCCGG TTTTTACGTG
GTTCGGCTCA GCTATGTCGC GAACATCGAC GAACTACTAG CAGCCGAACG CGCGACACGC
GCCGCGTTGA TTGAAGAGCC GTTCGTCATT GCCGGACAAG AGTTGGTGGT CGCGCATTCC
CGCTTTGATA TAGGGAACGT CGCCGATCTC TTCGATGGCA ATCCGGCTAC GTTGGCACGC
GGGTTTGAAG CAAACCCACT GGTGATTGAG TTGCGCTTTA CCAACCCCCA ACCGGTGAAC
GAGATCGAAT TGACCTTGGG GACGATGGAT TTCAACTTGA TCGTCACCAT CATAACACCA
GAGGGAGCAA CAACAACTAT CAGCCAACCG TATCGCAACT TACCACCCGA CCCAACGGTA
TCCCTCGCTC TCCCTCAAAC AGTAATGGCA GCACAAGTAC GTTTCGCTAT CTTGCAAGTG
GGGATCGGCG AGCCAGCTCA CATCCACGTG CGTGAAGTGC GCTGGCGGGA AGGCGATTCA
TCAGTCACTT GCACCCACTG CATCGGCAAT CAGCTTAGCC CGACGCAGAT CGGTTACTAA
 
Protein sequence
MNDAVTVNSS NWWNQPLTAT GWLNHLRRYE AVLLFASLAL YLYTRFTDLS AFPIYFFCDE 
AIHGVLARDL VANGFRDSNG VWLPPYFRNA EKWSLSLSVY VHAISILFFG FDRSVWMTRA
PSVIVGLLAP IGIAALLRFG YHNRYWWLGI LALCDMPAWF LHSRTAFETV LMVAFYACFL
AAYVAYRHYD DRWIVAVILF GAATFYSYAN GQGVMLISSL LLLISDARYH VSRPPQRLWM
ALALFAVVLI PLVRFRQLEP DATVEHLRTL NSYWFQPMPL GEKIQRFLTL YGQGLDPRYW
FWPNTIDLDR HRFPDRGHIP LLLLPFITIG LGVCLWRWCD SRYRLPIIAV LAAPFSSALV
GIAVTRTLAM VVPAALLSVI GLKAVIERLV PYRQRLVALT LGLVLALDSL ILLRTAVLGG
PLWFRDYGLY GMQYGAPQIF GETVPMLLAR DPQAIVRVSP VWANNPNSFV DFFLDPAQRR
RVSLTSIDAY LYYRGDLDRQ RDIFIMTATE YQQAQESGKF IIEPPEQIIP YPDGQPGFYV
VRLSYVANID ELLAAERATR AALIEEPFVI AGQELVVAHS RFDIGNVADL FDGNPATLAR
GFEANPLVIE LRFTNPQPVN EIELTLGTMD FNLIVTIITP EGATTTISQP YRNLPPDPTV
SLALPQTVMA AQVRFAILQV GIGEPAHIHV REVRWREGDS SVTCTHCIGN QLSPTQIGY