Gene Veis_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3849 
Symbol 
ID4695191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4246326 
End bp4247345 
Gene Length1020 bp 
Protein Length339 aa 
Translation table11 
GC content66% 
IMG OID639851598 
Productoligopeptide/dipeptide ABC transporter, ATPase subunit 
Protein accessionYP_998576 
Protein GI121610769 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0444] ABC-type dipeptide/oligopeptide/nickel transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0430239 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.916901 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGGCTG AACAAGCGAC GCAGGCCCTG GAGACGCGGT TGCTCCAGGT CGAGAATCTG 
CAGACGCGAT TTGCTACGCC CCATGGCCTG GTACGTGCCG TCGATGGTGT CAGCTTGCAG
CTGAACAGAG GCGAGACGCT GGGTGTGGTT GGCGAGTCGG GCTCGGGAAA ATCGATACTG
GCGCGCAGCC TGATGGGACT GCTGCCGCCG GCACCCTACA GCCAGCGCTC CGGCCGCATC
CTGCTGTCGG GACGCGACAT CGCCGCGCTG GATGAGGCTC AACTGTCCAC AGTGCGCGGG
CGTGAGATCG CGATGATCTT CCAGGACCCC ATGACTTCGC TGAACCCGGT TCTGACCATT
GGCCAGCAGA TCATCCAGGT GCTGCGTCGC CACACCGACC TGGGCGAGCG CGCTGCCCGC
GAACGGGCGG TCGAATTGCT GCAGCAGGTG AGGATTCCGG CGCCCGAGAG GCGCGTGCAC
GACCACCCGC ACCATCTTTC GGGCGGCATG CGACAGCGCG CGGTGATCGC GATCGCGCTG
GCCTGCGGGC CGCGCTTGCT GATCGCGGAT GAACCAACGA CGGCGCTGGA CGTGACCGTG
CAGGCGCAAA TCCTGCAACT GCTGCGCAGC CTGCAGGAGG TGCACCACAT GGCGCTGGTG
CTGATCAGCC ACAACCTGGG CGTGGTGGCG CAGATGTGCG ACCGCGTCGC TGTCATGTAC
GCCGGCCGCA TCGTCGAGGA AGCCCCGACT GCCGACCTGT TCGCCGCGCC CCGCATGCGC
TACACCGAGG CGCTCATGAA GTCCATGCCC CGGCTGGATG CGCCCAGCCA CGCTCTGCTG
TACGTGATTG GCGGGCGCCC GCCCAACCTT GCGCAGCCGG TTCCCGGCTG CGGTTTTGCG
CCGCGCTGCC GCGATACCGT TGCAAGCTGC GCGAGCATGT CACCAGCCGA GTCAATGGCC
GGAGAACGGC ACCGCTTCGC ATGCTGGAAT CCTTTGCAAG GAGTCAATGA TGGCCGGTAA
 
Protein sequence
MMAEQATQAL ETRLLQVENL QTRFATPHGL VRAVDGVSLQ LNRGETLGVV GESGSGKSIL 
ARSLMGLLPP APYSQRSGRI LLSGRDIAAL DEAQLSTVRG REIAMIFQDP MTSLNPVLTI
GQQIIQVLRR HTDLGERAAR ERAVELLQQV RIPAPERRVH DHPHHLSGGM RQRAVIAIAL
ACGPRLLIAD EPTTALDVTV QAQILQLLRS LQEVHHMALV LISHNLGVVA QMCDRVAVMY
AGRIVEEAPT ADLFAAPRMR YTEALMKSMP RLDAPSHALL YVIGGRPPNL AQPVPGCGFA
PRCRDTVASC ASMSPAESMA GERHRFACWN PLQGVNDGR