Gene Cagg_2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCagg_2971 
Symbol 
ID7266502 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChloroflexus aggregans DSM 9485 
KingdomBacteria 
Replicon accessionNC_011831 
Strand
Start bp3642213 
End bp3643790 
Gene Length1578 bp 
Protein Length525 aa 
Translation table11 
GC content56% 
IMG OID643567793 
Productintegral membrane protein MviN 
Protein accessionYP_002464267 
Protein GI219849834 
COG category[R] General function prediction only 
COG ID[COG0728] Uncharacterized membrane protein, putative virulence factor 
TIGRFAM ID[TIGR01695] integral membrane protein MviN 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000053192 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGCAACTGA GTACCGTCCA CCGCCGTCTC CGCACGCAAC GTAACAGCTT GATTGTGATG 
GGCGGCTTTA TTTTGAGCCG GATAACCGGC TTGATCCGCG ACATTGTTGC TTCATACTAT
TTCGGTACTT CTGCCGAAAT GGCAGCTTAT GGAGCCGCAA TCAGCACCGT CGACTTACTG
TACCTCGTAA TTATCGGTGG GGCGCTTGGC AGCAGCTTTA TTCCGGTATT CATTGAACTG
TGGGAGCGTG AGCAGCCAGA ACGCGCCTGG GAGCTGGCCA GTGCTGTAGT GACGTGGGCG
TTGATTATTC TGTTCGTGGC CAGCATTATC TTGTTCGGAG TAGCACCTTG GCTTGTCCCG
CTGCTCTACG GCGGACAGGG CTTTACAAGC GCAACCCTCG ACCTCATTGT CGCTATGACC
CGCCTGTTCC TTCTCTCGCC ACTATTGCTC GGCCTCGGCG GACTGGCGAT GGCGGCACTA
AATGCCCGTG ACCGGTTTAC GATGCCTGCA CTTGCACCGA GCATTTACAA TCTAGGCATT
ACCGGCGGCG CATTATTGGC GCCATGGGTT GGCATTTGGG GAATGGCATG GGGTGTCATC
ATTGGTGCGC TCTGTTACTT GCTGATCCAG CTACCGGCAC TGTTCGAGCT AGGGATGAAA
CTGCGACCAC AGTTGGGACA CAATATCGCA GAACTCAAAA AAGTAGCGCA AGCGATGGGT
CCGCGCGTGA TCGGGCAAGC GGCGGCCCAT CTGAGCATAG TCGCAACGTT AGCATTAGCT
GCCCGTCTAC CCGACGGTGA TGCCAAACTA GCCGGATTAC GCTGGGCCTA TCAATTAATG
CTTTTACCAT ACGGCGTCTT TGCGCTTAGC TTAAGCACCG TAGCGTTTCC GCGGCTCGCC
CGACTCGTCG CCGAACAACA ATTGAGTGAA CTGATCAATG ATGTGCGTAC AACCCTGAGC
CGCATTCTCT GGCTAACGCT ACCGGCAACC GCCGCCTTAC TGACCTTAGG CCCAGCGCTA
GCGCGCGTCT TATTTGAACG GGGCGCGTTT GATACACTCT CCCTCTCGTA CACTGCTGCG
GCCCTCACCG GCTACGCCTT CGCGTTACCG GCTTTCGCTG CATCAGAAAT CATGATCCGT
ACTTTTTATG CGATGCAACG TACCTGGCCA CCGGTGCTGA TCGGTCTCGG ACAGGTGACG
CTCAACATCG GTCTTGGTAC AGTATTGCTG TTCGCAGGAG CTGACATAGG CGGGCTAGCA
ATTGCCTTCA GTATCGCCAA CACCCTCGAA ACGGTGTTAT TGGCAATTGT CCTCGCACGC
ACACTACCGG GTATCTGGGA AACACCGAGA GTCTGGCAGC ACTTCATGAG CGCCTTATCT
GCCAGCCTCT TAGTTGGCGG GCTATGGTGG TATGCACGCG ATCTGATCCC CGGCGGCACA
CCAGCAGCCA GTTATCGCTG GCCGAACGAC GTACCAGGAC TACTGATCGG CTTGACCATA
ACCGGGATCG GCGGGGCAGC CCTGTACATC GTTCTCACGC TCTTGATGAA TCATTACACC
ACGAGAACGG CAAGTTAA
 
Protein sequence
MQLSTVHRRL RTQRNSLIVM GGFILSRITG LIRDIVASYY FGTSAEMAAY GAAISTVDLL 
YLVIIGGALG SSFIPVFIEL WEREQPERAW ELASAVVTWA LIILFVASII LFGVAPWLVP
LLYGGQGFTS ATLDLIVAMT RLFLLSPLLL GLGGLAMAAL NARDRFTMPA LAPSIYNLGI
TGGALLAPWV GIWGMAWGVI IGALCYLLIQ LPALFELGMK LRPQLGHNIA ELKKVAQAMG
PRVIGQAAAH LSIVATLALA ARLPDGDAKL AGLRWAYQLM LLPYGVFALS LSTVAFPRLA
RLVAEQQLSE LINDVRTTLS RILWLTLPAT AALLTLGPAL ARVLFERGAF DTLSLSYTAA
ALTGYAFALP AFAASEIMIR TFYAMQRTWP PVLIGLGQVT LNIGLGTVLL FAGADIGGLA
IAFSIANTLE TVLLAIVLAR TLPGIWETPR VWQHFMSALS ASLLVGGLWW YARDLIPGGT
PAASYRWPND VPGLLIGLTI TGIGGAALYI VLTLLMNHYT TRTAS