Gene Cagg_0704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0704 
Symbol 
ID7266956 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp871823 
End bp873091 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content54% 
IMG OID643565555 
Productprotein of unknown function DUF1501 
Protein accessionYP_002462064 
Protein GI219847631 
COG category[S] Function unknown 
COG ID[COG4102] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGCGTG AGCGGCACAG ATGGATAAAG TCGAACTTAC GTGGTGGTTG CGATGGACTT 
AGCCTGCTTA GCCCCTACGA TGATACCTAC TACCGTTCGG CAAGGGGAAC GTTGGCCTTG
CCATTAAGCG GCCCCAATGC ACCGTTGCGG ATCGATACTA ACAATCCCTC ATACAACACG
AACAGCTTCG GCTTCAATAG CAAGATGCCG CATCTGCGCG ATCTTTACAA CAGCGGCCAT
CTGGCTCTCA TCCATGCCTG CGGGTTAGAT GACGACACCC GCAGCCATTT TGATGCGATG
GACTACATCG AGCGCGGGAC ACCGGGCAAT AAAACCACAA ACAGCGGATG GCTCACCCGT
CACTTGCAGT CGCAGGGAGG AACCAGCAGT TTACTACCGG CAGTCGCAGC AAATACCGCC
GTGCCAGCCT CGCTGCTCAA TCACCCGCCG GCAATTGCAC TGTCGTCGCC GAGCAGTTTC
ACGGTGAGCA CGCATTGGCG CTACAATCGC GAACAAGATA ATTTTCCGTT CCTGACTACG
CTGCGCGAAA TGTACAACCG CAGTACGATC TATCCATTGG CGTCGGCCGG TCGGCGAGTG
ACGCAGGTGC TTGATCTGAT GCGTACTATG GGGAGCTATA CCCCAGCTTC GAGTATCGCT
TATCCTTCTG GTACATTTGG CGATGCGTTG AAGACGGTGG CTCAGTTGAT CAAAGCTGAG
ATTGGCCTGC AAATAGCGAC GATTGATTTC GGTGGCTGGG ATACCCACGA AGCACAGGCA
AACAGCGATG GTGGTGGCTA CTTACCCGAT CGACTCGGTG TGCTTTCGCA GGGATTGGGC
GCGTTTTACA ATGACCTCGC AGCATATCAC AACCGCTTGA CCATCGTTGT TCTGAGCGAA
TTTGGTCGTC GGTTGGGACG TAACCGGTCG AACGGTACCG ATCACGGCCA TGGTAATATG
ATGATGGTAC TGGGCGGCAA TGTGAACGGA CGCAAAGTGT ATGGTACGTG GCCGGGGTTA
CATCCCGATC AGCTTGATAA ACGGCAAGAT TTGCAGATTA CAACTGACTT CCGACAGGTG
CTGAGTGAGA TTTTGATTCG CCGATTGGGT AACCCGCTGC TTGGAGTGAT CTTCCCGGGC
TTGACATCGT ACACACCGCT AGGAATTGTG CGCGGGACCG ATTTACCACC GGTACTCTCT
GCTGACACGG TAGCACCTGC CAATACGGAG CATCGTATTT TCGTACCGGT GATTCAGCAA
TGTCGGTAG
 
Protein sequence
MLRERHRWIK SNLRGGCDGL SLLSPYDDTY YRSARGTLAL PLSGPNAPLR IDTNNPSYNT 
NSFGFNSKMP HLRDLYNSGH LALIHACGLD DDTRSHFDAM DYIERGTPGN KTTNSGWLTR
HLQSQGGTSS LLPAVAANTA VPASLLNHPP AIALSSPSSF TVSTHWRYNR EQDNFPFLTT
LREMYNRSTI YPLASAGRRV TQVLDLMRTM GSYTPASSIA YPSGTFGDAL KTVAQLIKAE
IGLQIATIDF GGWDTHEAQA NSDGGGYLPD RLGVLSQGLG AFYNDLAAYH NRLTIVVLSE
FGRRLGRNRS NGTDHGHGNM MMVLGGNVNG RKVYGTWPGL HPDQLDKRQD LQITTDFRQV
LSEILIRRLG NPLLGVIFPG LTSYTPLGIV RGTDLPPVLS ADTVAPANTE HRIFVPVIQQ
CR