Gene Veis_3972 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_3972 
Symbol 
ID4693949 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp4356397 
End bp4357812 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content59% 
IMG OID639851721 
ProductABC transporter nitrate-binding protein 
Protein accessionYP_998697 
Protein GI121610890 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.338423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.174767 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAGAG AAATCGACAG CCCCCTCTTC CACACTGACG ACGCCGGGTA CGCAGGCAAC 
ATTTGCTCCT GCGGCAAACA TGCCAGCCAG ATGCAGATGG CGCATGGCGA TGGGCACCCG
GCGGCGCAGC AACATCTGCG GCGAATACGT TCGCAGGATC CCGAACCATT GAGCAACGAT
GTGGTGGAAG AATCCGTGTT GCGCGCACTG TTTCCGCAGG CAGCGCAGCG TCGCGGCTTC
CTGCGTGCGG TCGGCGCCAA TACGGCGCGC GCGGCCATTG CCAGCATGTT TCCGTTGGAT
GCGTTGCAGG CGATGGCGCA AGAAAAGAAC GGCGCCATCG AAAAGAAGGA CCTCAAAATC
GGCTTCATCG CGCTTACCTG CGCAGCACCG CTGATCATGG CCGATCCGCT CGGCTTTTAC
CGCAAGCAGG GCCTGAACGT GTCGCTCAAC AAAACCGCCG GTTGGGCGCT GATTCGCGAC
AAGATGATCA ACAAGGAATA TGACGCATCG CATTTCCTGT CGCCGATGCC GCTGGCGATG
TCGATCGGTG CGGGCAGCCA TCCGGTGCAG ATGCGCATCG CGACGATCCA GAACATCAAT
GGCCAGGCCA TCACGCTGCA CGTCAAGCAC AAGGACAAAC GCAATCCCGG GCAGTGGAAT
GGCTTCAGGT TTGCGGTGCC GTTCGAGTAT TCGATGCACA ACTTCCTGCT GCGCTATTAC
CTTGCCGAAA ACGGACTCGA TCCGGATCGT GACGTGCAGA TCCGCGTCAC ACCGCCACCG
GAAATGGTGG CCAATCTGCG CGCCGGCAAC ATCGACGGCT TCCTCGGTCC CGATCCGTTC
AATCAGCGCG CGGTGTACGA CGAAGTCGGT TTCATCCATA TCCTCTCCAA GGAAATCTGG
GACGGTCATC CATGCTGCGC GTTCGGCATG TCGGAAGATT TCGTCAAACA AAATCCGAAT
ACCTTCGCGG CCTTGTTCCG CGCCGTGCTG ACTGCCGCAG CGATGGCGCG CGATCCGGCC
AATCGTTCTC TGGTGGCCAA GGTGATTTCG CCGGCAGCCT ATTTGAATCA GCCGGAAACG
GTGGTCGAGC AGGTGCTGAC CGGCCGCTTC GCTGACGGTC TCGGCAACGT GAAAACCGTG
CCGGGCCGCG CCGACTTCGA TCCGGTGCCG TGGGATTCGA TGGCGGTGTG GATTTTGAGC
CAGTTAAAGC GCTGGGGTTA TGTCAAAGGC GAGATCGATT ACAAAGGCAT CGCCGAGCGC
GTGCTGCTGT TGACCGATGC CAAAAAATAC ATGAAGGAAC TCGGCCAGCC AGTGCCGGAC
GGCGCCTATC GCAAACACAT GATCATGGGC AAGGAATTCG ACCCGGCCAA GGCCGACGCC
TACGTCAACA GCTTTGCCAT CAAGAGGACG AGCTGA
 
Protein sequence
MPREIDSPLF HTDDAGYAGN ICSCGKHASQ MQMAHGDGHP AAQQHLRRIR SQDPEPLSND 
VVEESVLRAL FPQAAQRRGF LRAVGANTAR AAIASMFPLD ALQAMAQEKN GAIEKKDLKI
GFIALTCAAP LIMADPLGFY RKQGLNVSLN KTAGWALIRD KMINKEYDAS HFLSPMPLAM
SIGAGSHPVQ MRIATIQNIN GQAITLHVKH KDKRNPGQWN GFRFAVPFEY SMHNFLLRYY
LAENGLDPDR DVQIRVTPPP EMVANLRAGN IDGFLGPDPF NQRAVYDEVG FIHILSKEIW
DGHPCCAFGM SEDFVKQNPN TFAALFRAVL TAAAMARDPA NRSLVAKVIS PAAYLNQPET
VVEQVLTGRF ADGLGNVKTV PGRADFDPVP WDSMAVWILS QLKRWGYVKG EIDYKGIAER
VLLLTDAKKY MKELGQPVPD GAYRKHMIMG KEFDPAKADA YVNSFAIKRT S