Gene EcSMS35_4734 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4734 
SymbolargF 
ID6144756 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4832383 
End bp4833387 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content52% 
IMG OID641619549 
Productornithine carbamoyltransferase subunit I 
Protein accessionYP_001746657 
Protein GI170683240 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0078] Ornithine carbamoyltransferase 
TIGRFAM ID[TIGR00658] ornithine carbamoyltransferase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.488433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.892607 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGGGT TTTATCATAA GCATTTCCTG AAATTACTCG ATTTCACGCC AGCTGAACTC 
AACAGCCTGC TGCAGTTAGC CGCGAAGCTG AAAGCCGATA AGAAAAGCGG TAAAGAAGAA
GCCAAACTCA CTGGTAAAAA CATCGCGCTC ATCTTCGAAA AAGACTCGAC CCGTACCCGA
TGCTCTTTCG AAGTTGCCGC ATATGACCAG GGTGCTCGTG TTACTTATCT CGGCCCAAGC
GGCAGCCAAA TTGGTCATAA AGAGTCGATT AAAGACACTG CCCGCGTGCT TGGTCGCATG
TATGACGGTA TTCAGTATCG CGGCTATGGT CAGGAGATTG TCGAAACACT GGCGGAATAC
GCTGGCGTGC CGGTATGGAA CGGCCTGACC AATGAGTTTC ACCCCACGCA GCTGCTGGCG
GATCTCCTCA CCATGCAGGA GCATTTGCCC GGCAAAACGT TCAACGAAAT GACGCTGGTC
TATGCAGGTG ACGCACGTAA CAACATGGGC AATTCGATGC TCGAAGCTGC AGCGCTTACC
GGTCTGGATT TGCGTCTGGT CGCGCCACAG GCATGCTGGC CGGAAGCTGC GCTGGTTACG
GAATGCCGCG CCCTGGCACA ACAAAATGGC GGGAATATTA CGCTGACGGA AGATGTAGCT
AAAGGAGTTG AAGGTGCTGA CTTTATCTAT ACCGATGTAT GGGTGTCGAT GGGTGAAGCA
AAAGAGAAAT GGGCTGAACG CATTGCATTG CTGCGTGATT ATCAAGTGAA CAGCAAGATG
ATGCAGTTGA CCGGTAACCC GGAGGTCAAA TTCCTCCACT GCCTGCCCGC GTTTCATGAC
GACCAAACGA CGCTTGGCAA AAAAATGGCG GAAGAGTTTG GCCTACATGG CGGAATGGAA
GTGACTGATG AGGTCTTCGA ATCTGCCGCC AGCATTGTAT TTGATCAGGC GGAAAACCGC
ATGCATACCA TCAAAGCGGT GATGGTCGCG ACGCTCAGTA AATAA
 
Protein sequence
MSGFYHKHFL KLLDFTPAEL NSLLQLAAKL KADKKSGKEE AKLTGKNIAL IFEKDSTRTR 
CSFEVAAYDQ GARVTYLGPS GSQIGHKESI KDTARVLGRM YDGIQYRGYG QEIVETLAEY
AGVPVWNGLT NEFHPTQLLA DLLTMQEHLP GKTFNEMTLV YAGDARNNMG NSMLEAAALT
GLDLRLVAPQ ACWPEAALVT ECRALAQQNG GNITLTEDVA KGVEGADFIY TDVWVSMGEA
KEKWAERIAL LRDYQVNSKM MQLTGNPEVK FLHCLPAFHD DQTTLGKKMA EEFGLHGGME
VTDEVFESAA SIVFDQAENR MHTIKAVMVA TLSK