Gene Ent638_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEnt638_1401 
Symbol 
ID5114366 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEnterobacter sp. 638 
KingdomBacteria 
Replicon accessionNC_009436 
Strand
Start bp1536099 
End bp1537076 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content48% 
IMG OID640491588 
ProductCRISPR-associated Cas1 family protein 
Protein accessionYP_001176133 
Protein GI146311059 
COG category[L] Replication, recombination and repair 
COG ID[COG1518] Uncharacterized protein predicted to be involved in DNA repair 
TIGRFAM ID[TIGR00287] CRISPR-associated endonuclease Cas1
[TIGR03637] CRISPR-associated endonuclease Cas1, YPEST subtype 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.612871 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.155418 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGTAA ATGGAATTAC GCCTTCGGAT CTCAAAACCA TTCTCCATTC AAAACGTGCC 
AATATTTATT ATCTGGAAAA ATGTCGTGTT CAGGTCAACG GTGGGCGAGT GGAGTATGTT
ACACAGGAAG GTAAAGATTC TTTTTACTGG AACATCCCTA TTGCCAATAC TACGGCAGTC
ATGCTCGGAA TGGGTACGTC GGTCACTCAA ATGGCGATGC GGGAATTTGC TCGCGCAGGG
GTTATGGTCG GCTTTTGTGG CACGGACGGT ACACCGCTTT ATTCGGCTAA CGAAGTGGAT
ATTGATGTTT CCTGGTTTTG TCCACAAAGT GAATATCGAC CTACCAGCTA TTTACAGAAT
TGGGTGTCAT TCTGGTTTGA TGAGCAAAAA AGGCTGCAAG CCGCAAAACA ATTCCAGTAT
ATCCGGTTGC AACAAATCGA AAAATATTGG CTGGCATCAA AAAAACAGCG TGATAAATCA
TTTCATCCAG ACAGTCAAAA TCTGAAAAAT AGTCTTGAAC GCGCCAGACT GGCAATGGAA
TCTGCAAACG ATCACACCAC GCTGATGCTT CAGGAAGCAC AGCTGACCAA ATCGCTTTAT
AAACTGGTTA GCCAAACCGT TGGCTATGGC CATTTTACTC GCGCTAAACG TGGCGGTGGC
GTGGATATGG CAAACCGCTT TCTCGATCAG GGAAATTATC TGGCCTACGG GCTGGCCGCT
GTCGCCACAT GGGTAACGGG AATACCGCAT GGTCTTGCGG TGATGCACGG CAAAACCCGT
CGCGGCGGGT TGGTATTCGA TATTGCGGAT TTGATTAAGG ACGCGCTTGT CATGCCGCAG
GCATTCCTGG CTGCCATGGA AGGCGAGGAT AATCAAATGT TTCGCCAGCG ATGCATAAAT
GCTTTTCAGC AGGCTGACGC CCTGGACCTC ATGATTTCCT CCCTTCAGGA AACGGCGGAA
GGGAGTGCGC TGCGATGA
 
Protein sequence
MSVNGITPSD LKTILHSKRA NIYYLEKCRV QVNGGRVEYV TQEGKDSFYW NIPIANTTAV 
MLGMGTSVTQ MAMREFARAG VMVGFCGTDG TPLYSANEVD IDVSWFCPQS EYRPTSYLQN
WVSFWFDEQK RLQAAKQFQY IRLQQIEKYW LASKKQRDKS FHPDSQNLKN SLERARLAME
SANDHTTLML QEAQLTKSLY KLVSQTVGYG HFTRAKRGGG VDMANRFLDQ GNYLAYGLAA
VATWVTGIPH GLAVMHGKTR RGGLVFDIAD LIKDALVMPQ AFLAAMEGED NQMFRQRCIN
AFQQADALDL MISSLQETAE GSALR