Gene Cagg_0477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_0477 
Symbol 
ID7266645 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp587149 
End bp588255 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content58% 
IMG OID643565340 
ProductNHL repeat containing protein 
Protein accessionYP_002461854 
Protein GI219847421 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.517472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.232991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAACA CAGCTTATCG CTGGTTAGGA TCACCGGCTC CGGGAGGGTT AGTCCTACCG 
GCAGCCAACC CAACACCGTC CCATCTCTAT GCCCCGCGCG GTGTTTATCT CGACGACGAG
CGACTGATCG TGGCCGATTC AGGGAATCAC CGTGTCTTAA TCTGGCATGG ATTTCCGGCG
ATTGACCATC AGCCTGCCGA TCTGGTACTC GGTCAACCAG ATTTCTTCCA CGAAGGGCCG
CGTGCTACCG GTCGCGGCCC TGATCATGGC TTTCAGCTCC CTACCGGAAT AACAGTGGCC
GAAGGCCGCC TCTACCTTGC CGATGCGTGG CATCATCGTG TATTATGTTG GCATCGCATC
CCCGATCGAT CCGGCACACC ACCCGATAGT GTGATCGGGC AAGATTCGCT ACTTGACATC
GAGCCAAATC GTGGCGGTAC GGTTGGGCCG CACACTCTCT ACTGGCCGTA TGGTGTAGCC
TGGATCAATG GCTGGTTTTA TATTGCCGAT ACCGGCAATC GACGGGTGTT AGGGTGGCGT
GGTCTTCCCA GCGATAGACA ACCGCCTGAT GTGATACTCG GTCAGCCTGA TGCCTACAGC
AACGCCGAGA ATCGTGGTGG CCCACCGACG GCAAATAGTT TTCGTTGGCC ACACGCCATT
GCCGGCGATG GCGAGACGTT GTACGTAGCC GATGCCGGTA ACCATCGGGT ATTGGGATGG
ACACCGCCAC CGGAAAGCGA TCGACCTGCC GATCTCGTGT TAGGACAACA CACCATGCAT
AGCGCATTTG AACAACCACA TGTGCCACAA GGGGCCTATC GGCTACGCTT TCCCTACGCC
GTTGCCTGCA ACAGCCATCG GTTATTTGTG GCCGATACGG CCAACAACCG GATTCTCGGC
TGGCGACCAC CGCCGCGTGT TGGCGCCGGC ATCCCGGCGC AGACGGTCTA CGGCCAGTAC
AATTTCGACG ACAGTGGCGA GAATCGCTGG CAAGCTGTCG CTGCCAATAC ATGTTGCTGG
CCTTACGGTC TCTGGCTGCA TCGGCACTGG CTGGTCGTTG CCGACTCCGG TAACAATCGC
GTCCTCATTT GGCATACTGA AAACTGA
 
Protein sequence
MMNTAYRWLG SPAPGGLVLP AANPTPSHLY APRGVYLDDE RLIVADSGNH RVLIWHGFPA 
IDHQPADLVL GQPDFFHEGP RATGRGPDHG FQLPTGITVA EGRLYLADAW HHRVLCWHRI
PDRSGTPPDS VIGQDSLLDI EPNRGGTVGP HTLYWPYGVA WINGWFYIAD TGNRRVLGWR
GLPSDRQPPD VILGQPDAYS NAENRGGPPT ANSFRWPHAI AGDGETLYVA DAGNHRVLGW
TPPPESDRPA DLVLGQHTMH SAFEQPHVPQ GAYRLRFPYA VACNSHRLFV ADTANNRILG
WRPPPRVGAG IPAQTVYGQY NFDDSGENRW QAVAANTCCW PYGLWLHRHW LVVADSGNNR
VLIWHTEN