Gene Veis_1109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagVeis_1109 
Symbol 
ID4693263 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameVerminephrobacter eiseniae EF01-2 
KingdomBacteria 
Replicon accessionNC_008786 
Strand
Start bp1232666 
End bp1233682 
Gene Length1017 bp 
Protein Length338 aa 
Translation table11 
GC content67% 
IMG OID639848887 
Productsubstrate-binding region of ABC-type glycine betaine transport system 
Protein accessionYP_995901 
Protein GI121608094 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.141218 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCACT TGCATCGTCA TCGCTCGCGC CGTCATTTCC TGCGCGCCTG CGCCGCCACT 
GCGGCCATCG CCCATCCCGT CATCGGCATG GCCCAGGGGC TGGCCGCGCA AACCACCATC
CACTATGGCG GCTCGGCCTG GCTGGGCCAC TATCCGGCTT ATCTGGCGCT GAAAAGCGGC
ACGCTCCCGG CGGCGTCGAT CGATCTGCGA TGGCAATCCT TCGGCACCTC CTCGGCGCGC
ATGAGCGCCG TCCTGTCCGG TGGGATCGAC ATCGCCTGCA CCGGCATCGT CTCGGCACTG
GCGCTGATGG CGCGCGGTTC CAGGCACTTT GCCATCATCG CCGTGCCGGA GGACTTCGGC
CGCGTCGAGG GCTTGTTTGT CCGCTCCGAT GTCAGCGCCA TCGAGCACCT GAGGGGCAAG
AAAATCGGCG TGACCTTCGC CTCCAGCGCG CACCTGCTGG TGCTCGACCT GCTGGCTGGC
GCGGGCCTGG GGCCTGCCGA TGTGACGGTG CTGAATGTGC CGGCCCCGGA GTTGCCTGCG
GCGATGGCGG CCGGCCAGAT CGATGCGGCG GCGGCATGGA CGCCGCAATT TCACCGGATT
CGCGCGCTGC CGGGCATCAA GCTGCTCGCC GATGACACGG CGTTCTCGCT GTTCAAGAGC
CACAAGGTCA CCCCGGGCCC GGATGTTCTG GTGGTGCGCC AGGCGTTTGC CGACAAGAAT
CCGCTGGCCG TGCGCGGCTT TCTCAAGGGC TATTTCAGTG CCATCGCGAT GCTGCGCGAC
CGCCCGCAGG AAGCCGCCCG GCAGTTGCTC GCGCTGACCG GCCTGTCGCT GGCAGACCAG
GTGGAGGCCA TCTTGGGGGC GCAGTGGTAC GGCAGCGAAC AGCAGCGGAA CCTGCTCAAG
GTGCCAGGCA CCTATGTCGA TGGACTGCAG GGTTTGGCCG ACATGCTGGT GGCGCACAAA
CAGATCGACA AGGCCCCGGT CGTTGGCCAA TGGATCGATG CCTCGCACTT GGCATGA
 
Protein sequence
MHHLHRHRSR RHFLRACAAT AAIAHPVIGM AQGLAAQTTI HYGGSAWLGH YPAYLALKSG 
TLPAASIDLR WQSFGTSSAR MSAVLSGGID IACTGIVSAL ALMARGSRHF AIIAVPEDFG
RVEGLFVRSD VSAIEHLRGK KIGVTFASSA HLLVLDLLAG AGLGPADVTV LNVPAPELPA
AMAAGQIDAA AAWTPQFHRI RALPGIKLLA DDTAFSLFKS HKVTPGPDVL VVRQAFADKN
PLAVRGFLKG YFSAIAMLRD RPQEAARQLL ALTGLSLADQ VEAILGAQWY GSEQQRNLLK
VPGTYVDGLQ GLADMLVAHK QIDKAPVVGQ WIDASHLA