Gene Ent638_4221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_4221 
Symbol 
ID5110433 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009425 
Strand
Start bp31444 
End bp32616 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content51% 
IMG OID640480838 
Productaldo/keto reductase 
Protein accessionYP_001165500 
Protein GI146284547 
COG category[C] Energy production and conversion 
COG ID[COG0667] Predicted oxidoreductases (related to aryl-alcohol dehydrogenases) 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.886889 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGC CAAAAACCCG TCCTGCACAA CGCACTGAAA GCATAAGCGC TTTCAGCCGA 
CGCAACTTTC TGTCTTCTTC TGCCTTAATG GGGGCAGGGT TGATTATGGG GAGTTTACCT
GACAGGGCGC ATGCAACGTC ATCAGAGCCG ACAGCAAAAC CGGCACAGGC CAGACAAGGT
TCACAGACAA TGCCGACGCG AAAACTTGGA TCTATGGTGG TTTCCGCACT GGGTGCCGGA
TGTATGAGTA TCAGCGCTAA CTACGGGGCG GCAGCGGATA AATCCCAGGG GATAAGAACG
ATACGCGAGG CACACGCCAG AGGCGTCACG CTATTCGATA CCGCCGAAGT TTATGGACCT
TATACCAATG AAGAACTGGT TGGCGAGGCG CTTGCTCCTG TTCGTAACCA GGTCTTTATT
GCCAGCAAAT TTGGATTTGA TATTCAACAT GGCGGGCTGA ACAGTCAGCC AAAACATATC
CGAAAAGTGC TGGAGGCCTC TCTCAGGCGT TTACGCACTG ACCGTATCGA TCTGTATTAT
CAGCATCGCG TTGATCCCGG TGTTCCCATA GAGGACGTAG CCGGGACTAT CCAGGATTTG
ATTAAAGAAG GCAAGGTTCT ACATTTTGGT CTTTCTGAAG CAAGTCCTTC TACCATCCAT
CAGGCTCATG CGATCCAGCC TGTCACCGCA GTACAGACGG AATATTCTGT CATGAACCGC
GATCCGGAAC ATAATGGTGT GCTGGATACC TGCGAGGAGC TGGGAATTGG TTTCGTCCCC
TGGGGGCCGA TAGGCATGGG GTATCTGACC GGAACGGTGA GCGTTAACAC TCATTTTGAT
CCCAAAACCG ACTTACGCTC CACTTTTGAA CGTTTTACGC CTGAAAATTT AGCGAATAAC
TGGCCCTTTG TGGAAAAGCT GAAAGCTATC GCTGACAGTA AGGGCGCGAC ACCGTCTCAG
ATCGCGCTTG CATGGCTTCT GGCCAAAAAA ACCTGGATTG TTCCTATTCC CGGGACACGA
AATATCAACC ATCTCCGTGA AAACCATGGT GCTTTAGAGA TCCAGTTAAC CACTACTGAG
TTAAGCGAAA TGGATAAAGC TATGTCCGGG CTTCGCGTCT ATGGTGGTCG CATGAATAGT
GCCCAGATGG ACCTCGTTGA GCCCAAAGCT TAA
 
Protein sequence
MSQPKTRPAQ RTESISAFSR RNFLSSSALM GAGLIMGSLP DRAHATSSEP TAKPAQARQG 
SQTMPTRKLG SMVVSALGAG CMSISANYGA AADKSQGIRT IREAHARGVT LFDTAEVYGP
YTNEELVGEA LAPVRNQVFI ASKFGFDIQH GGLNSQPKHI RKVLEASLRR LRTDRIDLYY
QHRVDPGVPI EDVAGTIQDL IKEGKVLHFG LSEASPSTIH QAHAIQPVTA VQTEYSVMNR
DPEHNGVLDT CEELGIGFVP WGPIGMGYLT GTVSVNTHFD PKTDLRSTFE RFTPENLANN
WPFVEKLKAI ADSKGATPSQ IALAWLLAKK TWIVPIPGTR NINHLRENHG ALEIQLTTTE
LSEMDKAMSG LRVYGGRMNS AQMDLVEPKA