Gene Athe_2519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_2519 
Symbol 
ID7409388 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp2641160 
End bp2643037 
Gene Length1878 bp 
Protein Length625 aa 
Translation table11 
GC content46% 
IMG OID643716882 
ProductTRAG family protein 
Protein accessionYP_002574360 
Protein GI222530478 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.166499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG CACTTGGAAT TGTAGCGATA CTGGTAATTT ACTACCTGAT AATGGCGTAT 
TTGATAGGGA TGTTCAGCGG TGCGATATCA CAGGACGCAT GGGAACAGAT TATGGCAGGG
AAGAGTCCAA AGATAGTGAT ACAGACCGAC CCGGTAGCAG CGCTGGGCAG TGTATTTCAC
AGTCCAGCAG TTTTGAGAAT CTGGCTTGGT GTTGCTACTG TCCTACTGGT GGTAGCAGGA
GTATTCATAT TTGCATACAG AGACGAGTTG AAGATACATT TTGGCAAGGA AGTGCCAGCA
GATGAGAGGA ACACATATGG CTCAGCAGAC TGGATGAAGC AAAGCGAAGC AAAAAAGGTA
TTCAAGTTTG GGAGCGAGCA AAAAGGGATA CTGCTGGGGA AGGTAAAAGG ACAGAGGGTA
GTGCTGCCAG CAGACACGAA GTATAACAAG CACATAGCAA TCTTTGGAGC ACCGGGAACA
GGGAAGTCAA GGACGTTTGC GAGGCCGAAC ATTGTGCAGA TAGCGAAGAT GGGGCAGAGC
ATGATAGTAA CGGACCCCAA AGGCGAACTT TTTGAGGACA TGTCAGTATG GTTAGAGAAG
CAGGGGTACG ACGTAAAAGC TTTAAATCTT GTCAACATCG AATGTTCTGA CAGGTGGAAT
CCGTTGGACG CAGTAGCAGA TGATATGGAT GCGACGGTAT TTGCGGATGT AGTGATAAGG
AACACACAGG CAGGGATGAG AAAGGCAGGT GGCGACCCGT TCTGGGACAG GGCGGAGCTG
AACCTGCTCA AGGCGCTGGT GCTGTACATT AAGGAGACGA GACCGGCGCA GGGACAGAAT
TTGGGTGAAC TGTACAGGCT TTTAGCGACA ACGAACATGG CAGGGTTGCA GAGCATGTTT
ATGAATATAC CAAACGACAG GGCATCGAAG ATGGCGTACA ACATATTTGC ACAGGCAAGT
GAACAGGTAA GAACTGGTGT TATCATAGGA CTTGGCACAA GGTTGCAGGT ATTCCAGAAC
AGGCTTGTCC AAAAAATGAC AGAAGTGAGT GACATAGATT TAGAAGCACC GAAGCGAAGG
AAGGTAGCGT ACTTTTGTAT CATAAGCGAC ACACATAGAG CGTTCAGTTT CTTGAGCTCA
CTGTTTTTCA GTTTCCTGTT TATCAAGCTT GTGAACCTGC ACGATACAAC AACAGATCCA
GAGATAAGAG CAAGGGAAGT GTACTTTTTG TTAGATGAGT TTCCCAACAT TGGTGAGATA
CCGGATTTTC AGGAGAAGAT AGCGACGATT CGTTCCAGGA GACTGCACTG CAGTATTATC
TTTCAGAGTT TAGGGCAGCT TGAGAAGATA TACCCGCTGG ACTGGGAAAA CATAATAGGA
TGTTGTGACA CAAAGTTTTT CCTGGGGGCG AACGATTTGA AGACGGCGGA GTATGTATCG
GAACTGCTTG GCACGAAATC AATACACACA CGGTCGGTAT CAAGGCAGGG CGGACTGGAA
GGTCTGACAG AGATAGAACG AATAACACAG TCAGTTGGTA AAAGACAACT TTTGACACAG
GATGAAGTGC TGAGATTTGA GAACGAGAAA GCGATAGTGA TGGTAAGGGG ACACAAACCG
TTGATAGTAG AGAAAGTTGA CATAAGCGAA TTGAAGGAGA GTAAACAATT AACCATCCGG
GCAATAAAAG ACTATTTGAG ACCGTGGGCA GCAGAAGTTA CTGGAAACAC AACTGCAGGG
CAAGTTTGCG ACGCAGGAGC ACAATCACAG ATAAACGCTG ACCAGTCACA AAATAGATTT
ACGACACCAC CAGAGAATAA TGCAGATGGT TCAGAAGGGA ACGAACAGAA GGAAACCGGG
GAAGACGAAT TACTATAA
 
Protein sequence
MKKALGIVAI LVIYYLIMAY LIGMFSGAIS QDAWEQIMAG KSPKIVIQTD PVAALGSVFH 
SPAVLRIWLG VATVLLVVAG VFIFAYRDEL KIHFGKEVPA DERNTYGSAD WMKQSEAKKV
FKFGSEQKGI LLGKVKGQRV VLPADTKYNK HIAIFGAPGT GKSRTFARPN IVQIAKMGQS
MIVTDPKGEL FEDMSVWLEK QGYDVKALNL VNIECSDRWN PLDAVADDMD ATVFADVVIR
NTQAGMRKAG GDPFWDRAEL NLLKALVLYI KETRPAQGQN LGELYRLLAT TNMAGLQSMF
MNIPNDRASK MAYNIFAQAS EQVRTGVIIG LGTRLQVFQN RLVQKMTEVS DIDLEAPKRR
KVAYFCIISD THRAFSFLSS LFFSFLFIKL VNLHDTTTDP EIRAREVYFL LDEFPNIGEI
PDFQEKIATI RSRRLHCSII FQSLGQLEKI YPLDWENIIG CCDTKFFLGA NDLKTAEYVS
ELLGTKSIHT RSVSRQGGLE GLTEIERITQ SVGKRQLLTQ DEVLRFENEK AIVMVRGHKP
LIVEKVDISE LKESKQLTIR AIKDYLRPWA AEVTGNTTAG QVCDAGAQSQ INADQSQNRF
TTPPENNADG SEGNEQKETG EDELL