Gene Noca_1074 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_1074 
Symbol 
ID4599554 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp1133007 
End bp1134242 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content75% 
IMG OID639775672 
Productexodeoxyribonuclease VII large subunit 
Protein accessionYP_922279 
Protein GI119715314 
COG category[L] Replication, recombination and repair 
COG ID[COG1570] Exonuclease VII, large subunit 
TIGRFAM ID[TIGR00237] exodeoxyribonuclease VII, large subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCTGG AGAGCTCGCT CGAGTCGCCG GCCCCGGTCC GGCAGATCGC GAACGCGATC 
GCCGGCTGGG TCGACCGGCT CGGCGCGGTG TGGGTCGAGG GCCAGGTGGC GCAGGTCAGC
CGGCGACCCG GCCTGAACAC GGTGTTCCTC ACCCTGCGCG ACGCCGTCGC CGACATCTCG
GTGCCGGTGA CCTGCTCGCG CACGCTGTTC GACGGCCTGA ACCCGCCCCT GGTCGAGGGC
GCCAGCGTGG TCCTGCACGC GAAGCCGTCG TACTACGCCA ACCGCGGCAC GCTCTCGCTG
TACGCCCGCG AGATCCGCAT GGTCGGCCTG GGCGAGCTGC TCGCCCGGCT CGAGCGACGG
CGCCAGCTGC TGGCTGCCGA GGGACTCTTC GCCGCCGAGC TCAAGCGACC GCTGCCGTTC
CTGCCCGGCA CCGTCGGCCT GGTCACCGCC CCCAACAGCG CCGCCGAGCG CGACGTGCTC
GAGAACGCCC GCCGCCGGTG GCCCGCCGTC GCCTTCGAGA TCGCGTACGC CGCGATGCAG
GGGCCGCGGT CGGCCAGCGA GGTGATCGAG GCGGTCGAGC GCCTCGACCG TGATCCGGCG
GTCGAGGTGA TCGTGGTCGC CCGGGGCGGC GGCTCGGTCG AGGACCTGCT GCCGTTCTCC
GACGAGGCGC TGATCCGCGC GGTGCACCGG ATCCGCACGC CGCTGGTCTC CGCGATCGGC
CACGAGCCCG ACTCCCCGCT GCTCGACCTG GTCGCCGATG TCCGCGCCTC GACGCCCACG
GATGCCGCCA AGCTCGTGGT CCCGGACGTG GCCGAGGAGC AGCGCAACGT GCGCCGTGCC
CGGGAGCGGG CGCGCGGCGC ACTGGCCAGG TGGATCGCCC GGGAACAGGC GGGCCTCGAC
GGGCTCCGCT CCCGGCCGGC GCTGGCCGAC CCGCGGATGC TCCTCGACGC CCGGCGCGAC
GAGGTCGACC AGCTGCGCGA CCGGGCCCGG CGCTGCCTGG GCCACGCGCT GGACCGGGCG
GCCGACGACA TCGGCCACCA CCGGGCCCGC GCCCGGGCGC TGTCCCCGCT GGCCACCCTG
CAACGCGGGT ACGCCGTGCT CCAGGACGCC GACGGCCACG TGGTCACCTC GGTCGGTGCC
GTCGCGCCGA AGCAGCAGGT CAGCGTGCGC GTCGCCGACG GCCGGATCCA CGCCACCACC
ACCAGCACCG AGGAGCTCGA TGTCCAAGAA GGCTGA
 
Protein sequence
MALESSLESP APVRQIANAI AGWVDRLGAV WVEGQVAQVS RRPGLNTVFL TLRDAVADIS 
VPVTCSRTLF DGLNPPLVEG ASVVLHAKPS YYANRGTLSL YAREIRMVGL GELLARLERR
RQLLAAEGLF AAELKRPLPF LPGTVGLVTA PNSAAERDVL ENARRRWPAV AFEIAYAAMQ
GPRSASEVIE AVERLDRDPA VEVIVVARGG GSVEDLLPFS DEALIRAVHR IRTPLVSAIG
HEPDSPLLDL VADVRASTPT DAAKLVVPDV AEEQRNVRRA RERARGALAR WIAREQAGLD
GLRSRPALAD PRMLLDARRD EVDQLRDRAR RCLGHALDRA ADDIGHHRAR ARALSPLATL
QRGYAVLQDA DGHVVTSVGA VAPKQQVSVR VADGRIHATT TSTEELDVQE G