Gene Cagg_2440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2440 
Symbol 
ID7266163 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp2960469 
End bp2961719 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content58% 
IMG OID643567266 
Productprotein of unknown function DUF214 
Protein accessionYP_002463749 
Protein GI219849316 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.652514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCTCTG AATTGATCGC TATGGCCTTT GATAGCCTAC GCGCCAATAA ATTTCGCTCT 
CTCCTCACAA TGCTTGGCGT GATCATCGGG GCAGGCACGC TCGTCGCAGT GCTCTCGCTC
GGTAACGCCT TACAAGGCCA AGTCTTCGAG CAGTTCGTCG ATCTCGGCAC TCGCCGGATT
GCAATCACGC CGGGCGATCC GCGGGCCAAA GGCGCACGTG ATGTGCCCGG ATACGGTCTG
CTCTCGGTGC AGGATTATCA GGTATTAGCA GAAATGGCAA CCAGTCGGCC TGATCTCTTT
CGTGCTATTG CGCCGGAAAT TACGGTCAGC ACGCAAGCCC GGGCCGGTAC GGTGGCGATC
CAAACGTTGC TGGTCGGCAC GACCGAAAAT TATCCGCAAG TCCAGCGCAC GCCGATGCTC
TACGGGCGAT TTTTGACCCC TGAAGATGAG GCAGAAGGGG CGCGGGTTGG GGTGTTGGGC
TGGCTTGTGG CCCGTGATCT GTTTGGCTCT GATAAGGCTG CGCTGCGGAA TGTGATCGGG
CAGACGATAG AGGTTAACGG TCAACCGATT GAGATTGTCG GCATCATTAA CGAAAACGGT
GGGCCTTTCT CGACCGATGG CCGCATCTTT ACGCCGGTCA GTACGATGCG CTTACGGCTG
ATCGGTGATC TTGATCTACC GGGACGTGGT TTGCGGATGA GCAGTATTCT GCTCGGCTTG
CAAAGCGAAC AGCAGGTCAA TGAGGCGGTA GCCTTGATTG AACGGACGCT GCGTGCCGCG
CGGAATGTGC CTGATGGCGT GATCAACGAT TTCAACCTAC AGACGCCGAC GCAAGCATTG
AACGTATTGG CGGCGATCAG CACGGCGATC ACCGGCTTTA TTGCGGTGGT TGCCGGGATT
AGTCTGGTGG TTGGTGGGAT TGGGATTATG AACATTATGC TGGTGGCGGT CACCGAGCGG
ACCCGTGAGA TCGGGGTGCG CAAAGCGTTG GGGGCCAGCG ATGGCGATGT GTTGGGCCAA
TTTGTGATGG AGGCGGTGGC CCTGAGCCTG GTTGGTAGTC TGATCGGCGT GATCGGGGCG
ATTGGTCTGG TCTGGTTGAT CAGTGCTGTT GGCGGGATCA ACACGGGCAT CTCGTGGATC
GGCATTGTGC TGGCGTTGGG GTTTGCATCG GCGATTGGGA TTGGGTTTGG GTATTACCCG
GCCCGACGGG CAGCGCTGCT GCCGCCGATT GAGGCGTTGC GGTACGAATA G
 
Protein sequence
MFSELIAMAF DSLRANKFRS LLTMLGVIIG AGTLVAVLSL GNALQGQVFE QFVDLGTRRI 
AITPGDPRAK GARDVPGYGL LSVQDYQVLA EMATSRPDLF RAIAPEITVS TQARAGTVAI
QTLLVGTTEN YPQVQRTPML YGRFLTPEDE AEGARVGVLG WLVARDLFGS DKAALRNVIG
QTIEVNGQPI EIVGIINENG GPFSTDGRIF TPVSTMRLRL IGDLDLPGRG LRMSSILLGL
QSEQQVNEAV ALIERTLRAA RNVPDGVIND FNLQTPTQAL NVLAAISTAI TGFIAVVAGI
SLVVGGIGIM NIMLVAVTER TREIGVRKAL GASDGDVLGQ FVMEAVALSL VGSLIGVIGA
IGLVWLISAV GGINTGISWI GIVLALGFAS AIGIGFGYYP ARRAALLPPI EALRYE