Gene Ent638_1357 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1357 
Symbol 
ID5114320 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1487905 
End bp1489557 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content62% 
IMG OID640491544 
Productphage tail-like protein 
Protein accessionYP_001176089 
Protein GI146311015 
COG category[S] Function unknown 
COG ID[COG5283] Phage-related tail protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAAAAG GCGCGGACAT TATCGATGTG TTACAGCGCA TGGGGGGCGT GGCTGACCGT 
CTGGACTATC GCAAGGCCGC CGCGCTCGGC TCCACGTTCC TGTCGCTGGG CGCTGCGCCA
GAAATTGCCG CCAGCGCGTC AAACGCCATG GTGCGTGAGC TGTCGATTGC CACTATGCAG
AGCAAGCGCT TCTTTGCAGG CATGGACCTG CTGAAACTCA ATCCGGCAGA GATTGAAAAG
CAGATGACCA CGGACGCCAT CGGCACCATC CAGCGCGTGC TGGAGAAGGT CAACCGCCTG
CCGCAGGACA AACGCCTGTC CGCCATGACG ATGCTGTTTG GCAAAGAGTT TGGCGATGAC
GCGGCGAAGC TCGCTAACAA CCTGCCGGAG CTGCAGCGCC AGCTCAGACT CACGTCCGGC
GGGGATGCGA ACGGCTCCAT GCAAAAAGAG TCCGACATCA ACAAGGATTC ACTTTCCGCG
CAGTGGTTAC TGGTCAAGAC AGGGGCGCAG AACGCCTTCA GCAGCCTGGG CGAAACGCTG
CGCGAGCCGC TGCTGTCCAT CATGAACACG GTGAAAGAGG TCACCGGTAC GTTCCGGCGC
TGGGTGGAAG AAAACCCTCG CCTGGCGGGC GGCCTGCTGA AAGTGGGGGC AGCGTTTGCC
GCGCTTATGG TGGTGCTCGG GACGATCATG CTGGCGGTGG CCGCGTTGCT CGGCCCGCTG
GCACTGATGC GCCTGCAGTT TTCCATTCTA GGCATTAAAG GCGGCGGTGC GTTCGGCCTG
ATTAGCAGCG CCATCAGTGG TGCCGGGAAA AGTGTTGTGT GGCTGGGCCG CCTGATGATG
GCGAACCCTA TTCTGGCGGT GATCAGCCTG ATTGCCATGG GGGCGATTTA CATCTGGCAG
AACTGGGAGA CGCTGGGGCC GAAATTCAGG GCGCTGTGGG ATGCGGTAAG CAATGGCGTC
TCGGCAGCGT GGGCCACGGT CAGACAGACC ATCAGCCAGA AATGGACGGA AATCCTCAGT
GATATTGCCG CGCTGCCGGA GAAATTCAAA GCCATGGGGA GCGCGATTAT CGACAACGTG
CTCGACGGCA TCAATGAAAA GTGGGAGGCG CTCAAAAGCA AACTGGCGTC GGTCACGGAT
TACTTGCCTG ACTGGATGAC CGGCAACACC ACAACGACAC CGCAGGTGCA GATTGTCGGG
GCGGCCACAC CGCGTGCGAT ACCGCCGGGC GGCAGTTTTG CGGGGATGTA CGACAGCGGC
GGGGCGATTC CGCGCGGGCA GTTTGGCATC GTGGGCGAGA ATGGCCCGGA AATTGTGAAC
GGTCCGGCGA ATGTGACCAG CAGGCGACGC ACTGCAGCAC TGGCGTCCGT GGTGGCTGGC
GTGATGGGAG TGGCGTCTAC ACCTGCAGAA GCAGCACCGC TGCATCCGTT CAGCCTGCCG
GTCAGGGCAT ATCAGGCGCA GCCCGCGAAG GCCGACAGCC AGCCAGCAAT TATCCGCTAT
GAGATTAACG CGCCCATTCA TATCACCGCC CAGCCAGGGC AAAGCGCGCA GGATATTGCC
CGCGAAGTGG CGCGCCAGCT CGATGAGCGT GAGCGCCGGG CCAGGGCGAA GGCCCGCAGC
AGTTTCAGCG ATCAGGGGGG ATACGAATCA TGA
 
Protein sequence
MSKGADIIDV LQRMGGVADR LDYRKAAALG STFLSLGAAP EIAASASNAM VRELSIATMQ 
SKRFFAGMDL LKLNPAEIEK QMTTDAIGTI QRVLEKVNRL PQDKRLSAMT MLFGKEFGDD
AAKLANNLPE LQRQLRLTSG GDANGSMQKE SDINKDSLSA QWLLVKTGAQ NAFSSLGETL
REPLLSIMNT VKEVTGTFRR WVEENPRLAG GLLKVGAAFA ALMVVLGTIM LAVAALLGPL
ALMRLQFSIL GIKGGGAFGL ISSAISGAGK SVVWLGRLMM ANPILAVISL IAMGAIYIWQ
NWETLGPKFR ALWDAVSNGV SAAWATVRQT ISQKWTEILS DIAALPEKFK AMGSAIIDNV
LDGINEKWEA LKSKLASVTD YLPDWMTGNT TTTPQVQIVG AATPRAIPPG GSFAGMYDSG
GAIPRGQFGI VGENGPEIVN GPANVTSRRR TAALASVVAG VMGVASTPAE AAPLHPFSLP
VRAYQAQPAK ADSQPAIIRY EINAPIHITA QPGQSAQDIA REVARQLDER ERRARAKARS
SFSDQGGYES