Gene Daro_3959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3959 
Symbol 
ID3567458 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp4255760 
End bp4256920 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content56% 
IMG OID637682432 
Productintegrase catalytic subunit 
Protein accessionYP_287156 
Protein GI71909569 
COG category[L] Replication, recombination and repair 
COG ID[COG2826] Transposase and inactivated derivatives, IS30 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones60 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCAGA GACCCCGAAT CTATTACACG GATAGCCAGA AGGCGCTGAT GTGGGAACGC 
TGGCGCAAGG GCGATTCGCT TCAAAAAATC GCCCAGTTAT TTGATCGAAA CCACTCATCG
GTTCAGGGAG TCCTGGCCGA GACAGGTGGC ATTCCACCGG CACCCCGATG TCGCTCGAAA
CGTTCGCTCA CGCTTGCTGA ACGGGAGGAG ATTTCCCGAG GCTTGGTGGC GGGGCTTTCA
ATCCGTGTGC TTGCTGCGCA ATTGGGACGA GCACCGTCCA CCGTCAGTCG TGAAGTCAAA
CGTAATGGTG GCCGTGAATG CTACCGAGCA ACTCAGGCCG ACCAGGCAAC CTGGACTCGG
GCGCTTCGCC CCAAGCCTTG CAAACTGACT GAGAACCCTG GGCTGGCGCA TATAGTGGCG
AACAAGCTGC AGTCGCTGTG GTCACCAGAA CAGATTGCTG GCTGGCTCAA GCGGACCAAC
CCAGACAATG CGAGCAATCA GGTGTCACAC GAGACAATCT ATCGCACCCT CTATATTCAA
ACCCGAGGCG CTCTGAAGAA AGAGTTGTTG GCGTATTTGC GGCGGACGCG AGCCATGCGC
CGCTCTCGCC ATCACACACA AAAGACTGCC GATCACGGCC GAATCGTCGA TGCTGTGTCA
ATCAGCGAAC GGCCGGCCAC GGCTGATGAT CGAGCGGTGC CGGGGCACTG GGAAGGGGAT
CTGCTGTGCG GCAGCAAGAA CAGTCAGATT GCAACGCTCG TCGAACGTCA GTCGCGCTAC
CTGATGCTGG TCAAGCTCTC CGGCAAAGAC ACCGGGACCG TCACCAACGC CCTGATCAAA
AACGCTCGTA AGTTGCCGCA AGACCTTTAC AAATCGCTCA CCTGGGACCG GGGCAAGGAA
ATGGCTGGCC ATAAGCGATT CACGCTGGCG ACGGATATTC AAGTCTATTT CTGCGATCCC
CATCACCCTT GGCAACGTGG AACGAATGAG AATACAAATG GCCTGTTGCG CCAGTATTTC
CCAAAGGGAA TTGACCTGTC CCCCTATTCG CAAGCGAAGC TGAGTGCCAT TGCCCGAAAA
CTGAATGAGC GCCCACGGAA AACACTAAAC TACGAAACAC CGGCACAACG TTTTTACCAA
ACCGTTGCAT CCACCGGTTG A
 
Protein sequence
MKQRPRIYYT DSQKALMWER WRKGDSLQKI AQLFDRNHSS VQGVLAETGG IPPAPRCRSK 
RSLTLAEREE ISRGLVAGLS IRVLAAQLGR APSTVSREVK RNGGRECYRA TQADQATWTR
ALRPKPCKLT ENPGLAHIVA NKLQSLWSPE QIAGWLKRTN PDNASNQVSH ETIYRTLYIQ
TRGALKKELL AYLRRTRAMR RSRHHTQKTA DHGRIVDAVS ISERPATADD RAVPGHWEGD
LLCGSKNSQI ATLVERQSRY LMLVKLSGKD TGTVTNALIK NARKLPQDLY KSLTWDRGKE
MAGHKRFTLA TDIQVYFCDP HHPWQRGTNE NTNGLLRQYF PKGIDLSPYS QAKLSAIARK
LNERPRKTLN YETPAQRFYQ TVASTG