Gene ECD_01878 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECD_01878 
SymboluvrC 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli BL21(DE3) 
KingdomBacteria 
Replicon accessionCP001509 
Strand
Start bp1938015 
End bp1939781 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content52% 
IMG OID 
Productexcinuclease ABC subunit C 
Protein accessionACT43732 
Protein GI253978062 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.83054e-06 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACGATG CTGGTGGTAC GGTTATCTAT GTCGGCAAAG CGAAAGACCT GAAAAAACGG 
CTTTCCAGCT ATTTCCGTAG CAACCTCGCT TCGCGCAAAA CCGAAGCGCT GGTCGCCCAG
ATCCAGCAAA TTGATGTAAC GGTTACTCAT ACAGAAACCG AAGCGCTGTT GCTGGAACAC
AACTACATCA AACTCTATCA GCCGCGTTAC AACGTTTTGC TACGCGATGA TAAATCTTAT
CCTTTTATCT TCCTGAGTGG CGATACCCAC CCGCGTCTGG CGATGCATCG TGGTGCGAAG
CATGCCAAAG GTGAATATTT CGGCCCGTTC CCGAATGGCT ATGCCGTACG TGAAACACTG
GCGCTACTGC AAAAGATTTT CCCCATTCGC CAGTGCGAAA ATAGTGTTTA TCGCAATCGC
TCGCGTCCGT GTCTGCAATA TCAGATAGGG CGCTGTCTGG GACCGTGCGT TGAAGGACTG
GTGAGTGAAG AAGAATACGC TCAGCAGGTC GAGTATGTGC GCCTGTTTTT GTCTGGCAAA
GATGATCAGG TGCTTACGCA ACTCATTAGT CGTATGGAAA CTGCCAGCCA GAATCTGGAG
TTTGAAGAAG CGGCACGTAT TCGCGACCAA ATTCAGGCGG TGCGACGCGT CACCGAAAAA
CAATTCGTTT CCAATACCGG CGACGACCTC GACGTTATTG GTGTGGCGTT CGATGCGGGC
ATGGCTTGTG TCCACGTATT GTTCATTCGT CAGGGCAAAG TGCTCGGCAG CCGCAGCTAT
TTCCCGAAAG TGCCTGGCGG TACGGAACTG AGCGAGGTGG TAGAAACCTT CGTAGGTCAG
TTCTATTTAC AAGGCAGCCA GATGCGCACC TTACCGGGTG AGATCCTGCT CGATTTTAAT
CTTAGCGATA AAACGCTGCT CGCCGATTCC CTTTCAGAAC TGGCGGGACG CAAGATTAAT
GTTCAAACCA AACCTCGCGG CGATCGGGCG CGTTATCTGA AACTCGCGCG CACCAATGCG
GCGACGGCCT TAACCAGCAA ACTTTCGCAG CAATCTACCG TTCACCAGCG GCTTACAGCA
CTTGCCAGTG TGTTGAAATT GCCGGAAGTG AAGCGGATGG AGTGCTTTGA CATCAGCCAT
ACCATGGGTG AACAAACCGT CGCTTCCTGT GTGGTGTTTG ATGCTAACGG CCCGCTGCGT
GCGGAGTATC GGCGCTATAA CATTACTGGC ATCACGCCGG GCGATGATTA TGCGGCGATG
AATCAGGTGC TGCGTCGGCG TTATGGTAAA GCCATCGACG ACAGTAAGAT CCCGGATGTG
ATACTTATCG ACGGCGGCAA AGGCCAGCTT GCGCAGGCGA AAAATGTCTT CGCCGAACTG
GATGTCTCAT GGGATAAAAA TCATCCGCTG CTACTTGGCG TTGCCAAAGG AGTAGATCGT
AAGGCTGGGC TGGAAACGCT GTTCTTTGAG CCGGAAGGTG AGGGATTCAG TTTGCCGCCA
GATTCTCCCG CGCTGCATGT TATCCAGCAT ATTCGCGATG AATCACACGA TCACGCGATT
GGCGGGCACC GTAAAAAACG GGCGAAGGTC AAAAATACCA GTTCCCTGGA AACCATTGAA
GGCGTCGGGC CAAAACGCCG GCAAATGTTG TTGAAATATA TGGGCGGTTT GCAAGGTTTA
CGTAACGCCA GCGTCGAGGA AATTGCAAAA GTGCCGGGTA TTTCGCAAGG TCTGGCAGAA
AAGATCTTCT GGTCGTTGAA ACATTGA
 
Protein sequence
MYDAGGTVIY VGKAKDLKKR LSSYFRSNLA SRKTEALVAQ IQQIDVTVTH TETEALLLEH 
NYIKLYQPRY NVLLRDDKSY PFIFLSGDTH PRLAMHRGAK HAKGEYFGPF PNGYAVRETL
ALLQKIFPIR QCENSVYRNR SRPCLQYQIG RCLGPCVEGL VSEEEYAQQV EYVRLFLSGK
DDQVLTQLIS RMETASQNLE FEEAARIRDQ IQAVRRVTEK QFVSNTGDDL DVIGVAFDAG
MACVHVLFIR QGKVLGSRSY FPKVPGGTEL SEVVETFVGQ FYLQGSQMRT LPGEILLDFN
LSDKTLLADS LSELAGRKIN VQTKPRGDRA RYLKLARTNA ATALTSKLSQ QSTVHQRLTA
LASVLKLPEV KRMECFDISH TMGEQTVASC VVFDANGPLR AEYRRYNITG ITPGDDYAAM
NQVLRRRYGK AIDDSKIPDV ILIDGGKGQL AQAKNVFAEL DVSWDKNHPL LLGVAKGVDR
KAGLETLFFE PEGEGFSLPP DSPALHVIQH IRDESHDHAI GGHRKKRAKV KNTSSLETIE
GVGPKRRQML LKYMGGLQGL RNASVEEIAK VPGISQGLAE KIFWSLKH