Gene ECD_04140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_04140 
SymbolinsG 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp4412360 
End bp4413688 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content54% 
IMG OID 
ProductIS4 predicted transposase 
Protein accessionACT45927 
Protein GI253980257 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCACATTG GACAGGCTCT TGATCTGGTA TCCCGTTACG ATTCTCTGCG TAACCCACTG 
ACTTCTCTGG GGGATTACCT CGACCCCGAA CTCATCTCTC GTTGCCTTGC CGAATCAGGT
ACTGTAACGC TACGCAAGCG CCGTCTTCCC CTCGAAATGA TGGTCTGGTG TATTGTTGGC
ATGGCGCTTG AGCGTAAAGA ACCTCTTCAC CAGATTGTGA ATCGCCTGGA CATCATGCTG
CCGGGCAATC GCCCCTTCGT TGCCCCCAGT GCCGTTATTC AGGCCCGCCA GCGCCTGGGA
AGTGAGGCTG TCCGCCGCGT GTTCACGAAA ACAGCGCAGC TCTGGCATAA CGCCACGCCG
CATCCGCACT GGTGCGGCCT GACCCTGCTG GCCATCGATG GTGTGTTCTG GCGCACACCG
GATACACCAG AGAACGATGC AGCCTTCCCC CGCCAGACAC ATGCCGGGAA CCCGGCGCTC
TACCCGCAGG TCAAAATGGT CTGCCAGATG GAACTGACCA GCCATCTGCT GACGGCTGCA
GCCTTCGGCA CGATGAAGAA CAGCGAAAAT GAGCTTGCTG AGCAACTTAT AGAACAAACC
GGCGATAACA CTCTGACGTT AATGGATAAA GGTTATTACT CACTGGGACT GTTAAATGCC
TGGAGCCTGG CGGGAGAACA CCGCCACTGG ATGATACCTC TCAGAAAGGG AGCGCAATAT
GAAGAGATCA GAAAACTGGG TAAAGGCGAT CATCTGGTGA AGCTGAAAAC CAGCCCGCAG
GCACGAAAAA AGTGGCCGGG ACTGGGAAAT GAAGTGACTG CCCGCCTGCT GACCGTGACG
CGCAAAGGAA AAGTCTGCCA TCTGCTGACG TCGATGACGG ACGCCATGCG CTTCCCCGGA
GGAGAAATGG GGGATCTGTA CAGTCATCGC TGGGAAATCG AACTGGGATA CAGGGAGATA
AAACAGACGA TGCAACGGAG CAGGCTGACG CTGAGAAGTA AAAAGCCGGA GCTTGTGGAG
CAAGAGCTGT GGGGTGTCTT ACTGGCTTAT AATCTGGTGA GATATCAGAT GATTAAAATG
GCGGAACATC TGAAAGGTTA CTGGCCGAAT CAACTGAGTT TCTCAGAATC ATGCGGAATG
GTGATGAGAA TGCTGATGAC ATTGCAGGGC GCTTCACCGG GACGTATACC GGAGCTGATG
CGCGATCTTG CAAGTATGGG ACAACTTGTG AAATTACCGA CAAGAAGGGA AAGGGCCTTC
CCGAGAGTGG TAAAGGAGAG GCCCTGGAAA TACCCCACAG CCCCGAAAAA GAGCCAGTCA
GTTGCTTAA
 
Protein sequence
MHIGQALDLV SRYDSLRNPL TSLGDYLDPE LISRCLAESG TVTLRKRRLP LEMMVWCIVG 
MALERKEPLH QIVNRLDIML PGNRPFVAPS AVIQARQRLG SEAVRRVFTK TAQLWHNATP
HPHWCGLTLL AIDGVFWRTP DTPENDAAFP RQTHAGNPAL YPQVKMVCQM ELTSHLLTAA
AFGTMKNSEN ELAEQLIEQT GDNTLTLMDK GYYSLGLLNA WSLAGEHRHW MIPLRKGAQY
EEIRKLGKGD HLVKLKTSPQ ARKKWPGLGN EVTARLLTVT RKGKVCHLLT SMTDAMRFPG
GEMGDLYSHR WEIELGYREI KQTMQRSRLT LRSKKPELVE QELWGVLLAY NLVRYQMIKM
AEHLKGYWPN QLSFSESCGM VMRMLMTLQG ASPGRIPELM RDLASMGQLV KLPTRRERAF
PRVVKERPWK YPTAPKKSQS VA