Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mmar10_1467 |
Symbol | |
ID | 4284577 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Maricaulis maris MCS10 |
Kingdom | Bacteria |
Replicon accession | NC_008347 |
Strand | + |
Start bp | 1607888 |
End bp | 1609741 |
Gene Length | 1854 bp |
Protein Length | 617 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 638140950 |
Product | peptidyl-dipeptidase A |
Protein accession | YP_756697 |
Protein GI | 114570017 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.000834106 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0509695 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAACGTT TGATGACCGG GGTCGCCTCC GGCGCCCTTC TGGCAGTACT TGCGGGCTGT AGCGCCCCCA ATGAGCCTGC TGCCGACACG CAAGCTGCTG CACCTGCAAC CGCCTCACCC GTCACGGTGG CCGATGCCGA ACAATTCCTC GCCGATGCGA CGACGGAATT GCGTGAGTTC AGCGAGTTCG GCGCCCGCAC CGCCTGGGTC CAGAACAATT TCATCACTTA TGACACCAAC TGGCTGCTCG AGCGCATGTC GACTCAAGGC ACCGAGATGG CTGTCCGTCT GGCGACCGAG ACCGCGCGCT TCAATGAGCT GGAGATGACC ACCGAAATGT CGCGGCAGAT GAATGTGCTG CGCGCCGGGA TCACACTGCC GGCGCCGTCA GACAGTGCCG CGGCGGCACG CCTGTCCGAG CTGACGACCC GGATGGGCTC GGCCTATTCG ACCGGTCTGA TGGAAATCGA CGGCGAGATG GTCGACCATA ACGAGCTGGA AAACATCATG CGGACCAGCC GCGACCCGGA ATTCCTGTCC CATGTCTGGG CTGGCTGGCG TGACAGCTAC AATCCCGAGC AGATGTCGAC CGACTATGCC GAAATGGTCG AGATCGCCAA TGCCGGCGCC CGCGATCTGG GTTTCTCCGA TCTCGCCGAG ATGTGGCTGT CCAATTACGA CATGCCGGCC GACGAGATGG AGGCCGAAGT CGAGCGCCTC TGGGGCCAGG TCGAGCCGCT TTATGAGCAG CTCCATTGTG CCGTCCGCTC GGAGCTGAAT GGCCTCTATG GCGACGAGGT CCAGGCTGCC GAGGGTCCGA TCCGGGCCGA TCTGCTGGGT AATATGTGGG CCCAGTCCTG GGCGGCCCTG GCCGATGTGG CCTCGGTCAG CGATGCCGGC CCTGCCTATG ACCTGACCCA GCTGCTGGTT GATGCCGATT ACGACCAGAT CCGCATGGTC GAAACCGCCG AGACCTTCTT CACCTCCCTG GGCATGGAAG AGCTGCCAGA CACCTTCTGG GAGCGTTCCC TCATCACCCA GCCGCGTGAC CGCCAGGTCG CCTGCCACGC CTCGGCCTGG AATCTCGACA GCGTCGATGA CCTGCGCATC AAGATGTGTA CCCGCGTCAA TGCCGATGAC TTCGTCACCG TGCACCACGA GCTGGGCCAC AACTTCTACC AGCGCGCTTA CAACCAGCAG GACTTCCTGT TCCAGGGCGG AGCGCATGAC GGTTTCCACG AAGCCATCGG CGACTTTATC GCCCTGTCGG TGACGCCAGA TTATCTGGTC CAGATCGGCT TGCTCGACCA GGCGGACGTG CCGGATGCTT CGGCCGACCT TGGCCTGCTG ATGGATACGG CGCTCGACAA GATCGCCTTC CTGCCCTTCG CCGTGATGAT GGATCAGTGG CGCTGGCGGG TGCTGCGTGG CGAAATCCAG CCGGACAGTT ACAATGACGC CTGGTGGGAG TTGCGCGAAA GCTATCAGGG CATCGTGCCG CCGGTCGAAC GTGGCGAGAC GGCCTTCGAT CCGGGCTCGA AATACCACAT CGCCAACAAT GTGCCCTATC TGCGCTACTT CCTGAGCTTC ATCATGCAGT TCCAGTTCCA CGAGGCCGCT TGCGAGATGG CTGGCTGGGA AGGTCCGCTG CACCGTTGCT CGATCTACGG CAATGAGGAA GTTGGCGCCC GCTTCAGCGC GATGATGGAA ATGGGGGCCT CACAGCCCTG GCCGGACGCG CTGGAAGCCT TTACCGGCAC CCGCGAAATG GATGGATCGG CCATCATCGC CTATTTCCAG CCGCTGATGA CGCATCTGGA AGAACAGAAC GCCACCCGCG ATTGCGGCTG GTAG
|
Protein sequence | MKRLMTGVAS GALLAVLAGC SAPNEPAADT QAAAPATASP VTVADAEQFL ADATTELREF SEFGARTAWV QNNFITYDTN WLLERMSTQG TEMAVRLATE TARFNELEMT TEMSRQMNVL RAGITLPAPS DSAAAARLSE LTTRMGSAYS TGLMEIDGEM VDHNELENIM RTSRDPEFLS HVWAGWRDSY NPEQMSTDYA EMVEIANAGA RDLGFSDLAE MWLSNYDMPA DEMEAEVERL WGQVEPLYEQ LHCAVRSELN GLYGDEVQAA EGPIRADLLG NMWAQSWAAL ADVASVSDAG PAYDLTQLLV DADYDQIRMV ETAETFFTSL GMEELPDTFW ERSLITQPRD RQVACHASAW NLDSVDDLRI KMCTRVNADD FVTVHHELGH NFYQRAYNQQ DFLFQGGAHD GFHEAIGDFI ALSVTPDYLV QIGLLDQADV PDASADLGLL MDTALDKIAF LPFAVMMDQW RWRVLRGEIQ PDSYNDAWWE LRESYQGIVP PVERGETAFD PGSKYHIANN VPYLRYFLSF IMQFQFHEAA CEMAGWEGPL HRCSIYGNEE VGARFSAMME MGASQPWPDA LEAFTGTREM DGSAIIAYFQ PLMTHLEEQN ATRDCGW
|
| |