Gene AnaeK_1999 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAnaeK_1999 
Symbol 
ID6786027 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaeromyxobacter sp. K 
KingdomBacteria 
Replicon accessionNC_011145 
Strand
Start bp2250785 
End bp2252488 
Gene Length1704 bp 
Protein Length567 aa 
Translation table11 
GC content71% 
IMG OID642763457 
ProductMammalian cell entry related domain protein 
Protein accessionYP_002134356 
Protein GI197122405 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1463] ABC-type transport system involved in resistance to organic solvents, periplasmic component 
TIGRFAM ID[TIGR00996] virulence factor Mce family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCCCG CCGTCAACAA GGCCCTCGCC GTGGGCGTGC TCGTCGCCGT CGGCCTGGCG 
GCGTTCCTGT TCGCGTTCAC CTTCTTCAAG AAGGGCGGGT ACTCCGAGGC GGACAGCTAC
CTCGTGTACG CGCGGTTCAG CGACGCCACC GGCCTCACCT GGAAGAGCAA GGTGCAGATC
GCCGGCATCC AGGTGGGCGA GGTCGCGAAG ATCTCGCTCG ACAAGAACAA GGCGCTGCTC
CAGATCCGCA TCGACCGCTC GGTGCCGCTC CACACCGACG CCTGCCTCTA CAAGAGCTTC
CCGTCCGCGC TGCTCCCCGA CGCGCTGCTC GAGGTCATCG CCGGCTCCGA CGCCGCGCCG
CTCCTCTCGT CGCTGCCGGA GGCCGAGCGC GAGATCAAGT GCGTGCGCGA GGCCACCAGC
GTGCAGCAGC TGCTCGACTC GATGGCGAAG ATCGCCAGCG ACGTGCAGCT CGTCACCGGC
GACCTCGCCA AGACCGTCCA GGGCGACCAG GGCAGCCTGC GCGAGATCGT GGAGAACCTG
GCCCGCATCA CGCGCCAGGT CGATCAGGTG GTGGCGCAGA ACAGCGCCAA CCTCTCCGAG
CTCATCGCGA ACACCCGCGA CTTCACCGCC GACCTGCGCG AGATCTCGGC GCGCGACAAG
GACCGCATCC ACAGCATCCT CGCGAACGTG GACGAGCTCA CCGCGCGCCT GAAGGTCGCC
GCGGGCAGCC TGCAGGGCAT CCTCGACGGC GGCGGCTCCG GCGCTCCGGG CGGCGGTCCG
CCCGGCGCTC CCGGCGCTCC CGGCGCCCCG GGAGCACCTG GCGCACCCGG CGCGCCCGGC
ACCGCGGGCG CGACGCCGGC GGTCGCCAGC CAGCAGGCGC AGGCGAAGGG CGTGCAGCAG
GCGGTGGCGC GCCTCAACGA CAGCCTCTCC CGGCTCGACC AGCTCCTCGC CAAGGTCCAG
GAGGGGAAGA GCGTCGCCGG CCGGCTCCTC ACCGACGAGA AGATGGGCCG CCAGCTCGGG
ACCGCGGTGG AGGGCGTCTC GGACTACGTG GACCGGCTGC AGAAGATGCA GATCGAGGTC
CAGCTCCGCT CCGAGTGGCT GCTCAACCAG AGCGTGGAGG ACGGCCGCCC CGGCGCGAAG
GTCTACTTCG GCGCGAAGCT GCTGCCGCGC CCGGACAAGT ACTACCTGCT CGAGGTGGTG
AGCGATCCGC GCGGCGTGGA CACGGTCACG ACCGACACCA TCACCACCCG CACGCCGGGC
TCGGTCGGCG ACTCGACCAC GGTCACCACC CGGACCCGGC ACGAGGACAA GGTCACGTTC
TCGCTGCAGA TGGCGAAGCG CTACGGCCCG GTCACGTTCC GCGGCGGCGT CATCGAGAGC
TCCGGCGGCC TCGGCGCCGA CCTGCACCTC ATGAAGGACC GGCTCCAGAT CTCCACGTCG
CTCTACCAGT TCTCGCGGCC GTACCAGGAC GTGTTCCCGC GCGCCAAGGT CTGGGCGAAC
TACAACTTCC TGCAGCACTT CTACGTCACC ACCGGCGTCG ACGACTTCCT GAACCGGTGG
CGCAGCGCCG CCTCGCCCGA CGGCCGCAGC TTCAACATCG GCACCGACGT GTTCTTCGGC
GCGGGCCTCT ACTTCACCGA CGACGACCTG AAGACGCTGC TCGTCTCGGG CGCCGGCAGC
GCCGCGAGCG GCGCCGGCAA GTAG
 
Protein sequence
MKPAVNKALA VGVLVAVGLA AFLFAFTFFK KGGYSEADSY LVYARFSDAT GLTWKSKVQI 
AGIQVGEVAK ISLDKNKALL QIRIDRSVPL HTDACLYKSF PSALLPDALL EVIAGSDAAP
LLSSLPEAER EIKCVREATS VQQLLDSMAK IASDVQLVTG DLAKTVQGDQ GSLREIVENL
ARITRQVDQV VAQNSANLSE LIANTRDFTA DLREISARDK DRIHSILANV DELTARLKVA
AGSLQGILDG GGSGAPGGGP PGAPGAPGAP GAPGAPGAPG TAGATPAVAS QQAQAKGVQQ
AVARLNDSLS RLDQLLAKVQ EGKSVAGRLL TDEKMGRQLG TAVEGVSDYV DRLQKMQIEV
QLRSEWLLNQ SVEDGRPGAK VYFGAKLLPR PDKYYLLEVV SDPRGVDTVT TDTITTRTPG
SVGDSTTVTT RTRHEDKVTF SLQMAKRYGP VTFRGGVIES SGGLGADLHL MKDRLQISTS
LYQFSRPYQD VFPRAKVWAN YNFLQHFYVT TGVDDFLNRW RSAASPDGRS FNIGTDVFFG
AGLYFTDDDL KTLLVSGAGS AASGAGK