Gene Ent638_3101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_3101 
Symbol 
ID5112641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp3375316 
End bp3376503 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content47% 
IMG OID640493300 
Productphage integrase family protein 
Protein accessionYP_001177816 
Protein GI146312742 
COG category[L] Replication, recombination and repair 
COG ID[COG0582] Integrase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.241472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCGGA TCACACGCCC CCTAACTAAC AACGAAATTC TTAAAGCTAA ACCTCGCGAA 
AAAGACTTTA CCCTCCATGA TGGGGACGGC TTGTTCTTAC TCGTCAAAAC CTCTGGTAAA
AAACTCTGGC GTTTTCGCTA CCAGCGACCA AACAGCACCA GCCGTACAAA TCTCAGCCTT
GGCGCATATC CTGCCCTTAC GCTTGCAGCA GCCCGTCTGA TACGCGATCA GCATTTGTCT
CTCTTAGCAC AGGACATAGA TCCTCAGCAG CAACAAGAAA TAGTCTCAGA ACAGCGCCAA
ATAAAGCTGG ACAGCGTTTT CTCTACAGTT GCCGCCAATT GGTTCCAGCT AAAGAGCAAA
AGCGTAACAC CGGATTATGC AAAAGACATT TGGCGCTCAT TAGATAAAGA CGTGTTCCCT
GCTATTGGCG AGATACCAGT TCAAGAGATC AAAGCCAGAA CTATTATTGA AGCGCTTGAG
CCTATCAAAG CGCGTGGAGC ACTGGAAACA GTTCGTCGTC TTGTACAGCG TATCAATGAG
ATTATGATTT ATGCGGTAAA TACCGGCTTG CTTGATGCCA ACCCAGCGTC AGGGGTTGGA
ATGGCTTTTG AGAGACCCAA GAAGCAAAAT ATGCTTACGC TTCGACCAGA AGAATTGCCC
AAGCTGATGC GTTCAATAGG CATGTCAAAT CTGTCTGTTC CAACTCGCTG CCTAATCGAA
TTGCAGCTCC TCACCCTTGT TCGCCCTTCA GAAGCTTCTG GTGCTCGATG GGCAGAGATT
GATATCGATG CAAAGCTTTG GAAAATCCCA GCAGAACGGA TGAAAGCGAA GCGTGAACAC
ATTGTACCTT TATCTCCTCA GGCGTTAGAG ATTCTAGAGA TTATGACGCC TATCAGTGCG
CATCGCGAGT ATGTGTTTCC AAGCAGGAAT GATCCAAAGC AACCCATGAA TAGCCAGACG
GCTAATGCGG CTATAAAGCG TATTGGCTAT GGAGGCCGTC TAGTTGCACA TGGTCTTCGT
TCTATCGCAA GTACAGCGAT GAATGAGGAA GGATTTAATC CTGATGTTAT TGAAGCGGCA
TTAGCCCATA GTGATAAAAA TGAAGTTCGT CGAGCATATA ATAGATCTAC ATACCTTGAA
GCACGGAGAG AACTAATGGA TTGGTGGGGT TCAGCCATAT ACAAATAA
 
Protein sequence
MARITRPLTN NEILKAKPRE KDFTLHDGDG LFLLVKTSGK KLWRFRYQRP NSTSRTNLSL 
GAYPALTLAA ARLIRDQHLS LLAQDIDPQQ QQEIVSEQRQ IKLDSVFSTV AANWFQLKSK
SVTPDYAKDI WRSLDKDVFP AIGEIPVQEI KARTIIEALE PIKARGALET VRRLVQRINE
IMIYAVNTGL LDANPASGVG MAFERPKKQN MLTLRPEELP KLMRSIGMSN LSVPTRCLIE
LQLLTLVRPS EASGARWAEI DIDAKLWKIP AERMKAKREH IVPLSPQALE ILEIMTPISA
HREYVFPSRN DPKQPMNSQT ANAAIKRIGY GGRLVAHGLR SIASTAMNEE GFNPDVIEAA
LAHSDKNEVR RAYNRSTYLE ARRELMDWWG SAIYK