Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Saro_2030 |
Symbol | |
ID | 3917677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Novosphingobium aromaticivorans DSM 12444 |
Kingdom | Bacteria |
Replicon accession | NC_007794 |
Strand | - |
Start bp | 2163622 |
End bp | 2165904 |
Gene Length | 2283 bp |
Protein Length | 760 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 640444782 |
Product | ComEC/Rec2-related protein |
Protein accession | YP_497303 |
Protein GI | 87200046 |
COG category | [R] General function prediction only |
COG ID | [COG0658] Predicted membrane metal-binding protein |
TIGRFAM ID | [TIGR00360] ComEC/Rec2-related protein [TIGR00361] DNA internalization-related competence protein ComEC/Rec2 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTGTCTC CGGCAGAAGT TGGCCCAGCA GCAGGATTTC CTGCCGCCGA TCCGGATGCT CTGCCAGCGC GACGCGCGCG GCATTGGGAC ATCCGCGCGC GATTGTCCAG CATGCTTGAT GATGGAGAGG CGTTCCTTGC CGCCCACCCG TTTGAACGCG GCCTCTGGCT GGTGGTTTCC TTCGGCGCGG GGATCGTATG CTGGCTCGCG CTTCCCCTGT CGGTCCAGTG GACGGCGACG CTGCTGGCCT GCGGTTCTGC CGCGCTGATC TCCCTGCTGT ACGTCGACGA ACACCGCCTT CCATACATCC GGCTCGGCAT CGCGGGCGTG GCGATCATGC TGGCGGCCGG CATGGCAACG GTGTGGACCA AATCCGCCCT GGTGGGCCAG CCCGGCATCG GTCGGCCGAT GGTGACCACT GTGGTTGGAA CGGTCCTCGA CCGGCGCGAG GAACCGGCCA GGGAACGCTC GCGCCTTCTG CTGCTGACGC GCCTGCCCGA GTTTCCCGAT CCTGTTCGCG TCAGGGTGAA CCTTCCGAAG GACGAAGACC GGCCCTTCAT CGTGGAGGGC GCTACGCTTG CCTTGCGCGC GCGATTGATG CCGCCCGCGC CGCCCATGCT GCCCGGTGCC TATGATTTCG CCCGCAAGGC CTGGTTCGAT GGAATCGCGG CCACCGGGAC GGTCATGGAC GATGTCCGGC TTGTTTCGCC GTCCGGCAAC GCCACCACCT TGCGGAAAAT CCAGAGGGCC CTGGCGGATC ACGTCCGCTC TCGTCTCGCC GGTTCCGCAG GGGCGATAGC CGCGGCCTTT GCCAGCGGAG ACAGGGGCGC AATCGCCAAG GCGGACGAGG ATGCGATGCG CGATGCCGGG CTTACCCATC TGCTGTCGAT CAGCGGCCTC CATGTCAGTG CGCTGGTGGG GGCCGTTTAC TGGATTTTCG CGCGATCGCT TGCGCTGGTC CCCTGGGTCG CGCTGCGCAT CCGCGTACCG ATCGCCGCCG CGCTGGCGGG TGCGCTTGCC GGCATTGCCT ACACCCTCAT CACCGGGGCC GAAGTGCCCA CCATCCGTTC GTGCATAGGG GCCCTGCTCG TGCTCGCGGC ACTTGCGCTC GGGCGTGATC CGCTCTCGCT GCGCATGGTC GCGGTCGGTG CCCTTGTCGT CATGCTGTTC TGGCCGGAAG CCGTGTTCGG GCCCAGTTTC CAGATGAGCT TTGCTGCGGT GATCGCCATC GTCGCCTTCC ATTCCGCCGC GCCCGTCCGC GGATTTCTCG CCGGCGAACG ACATGGGGGC CTGGCGCGGC TTGCCCGCAA CGTCCTGCTT CTGCTGGCAA CCGGCCTGGT CATCGAACTC GCGCTCATGC CCATCGGCCT GTTCCACTTC CACCGGGCGG GCGTCTACGG ATCGCTGGCC AATGTCATTG CCATCCCGCT CACCACATTC GTGACCATGC CGCTGATCGG CATCGCGCTG CTTCTCGACC TTGCCGGGCT GGGTGGACCC GCGTGGTGGC TCGTCGAAAC GTCGCTGGAT CTTCTGCTGG CACTCGCGCA CTTCGTCGCC GACCGGCCGG GGGCGGTCAC CATGCTGCCG CCGGGGGATC GGTGGAGCTA CCTCGTCTTT GTCGCCGGAA TGCTGTGGCT GGCGCTCTGG ACCGGCCGCG TCAGGCTGTG GGGCCTCCTG CCCGCATTCG CCGCAGCGCT TTCCATGGCG CTGACGCCCA CGCCCGACAT CCTCGTCACC GGGGACGGCC GGCATGTCGC CATCGCGGGC GAGGGGGCGG AACTCCTAGT CCTGAGAGCG GGGCGGGGAG ACTTCATCCG CGAAAATCTT CTGGAACTGG CCGGAATGGA GGGGGAGACG CGCCCCCTCG ACGAGTGGCG TGGTGCGCGC TGCGGTGAAG ATTTCTGTGC CGCGTCGCTC ATGCGGGGCG GACGCCACTT TGCTATTCTC ATGGCCCGCA GCCGCAACGA CGTCGACGAG ATGGACATTG CCGCCGCCTG CGAGCGAAGC GACATCGTGA TCGCCGACCG TCGCCTGCCG CACACCTGCC GCCCGAAGCT GCTCAAGGCA GACCGCGCGT TCCTCGCAAT GACCGGCGGC CTCTCCATCG ACCTAAGCCG CCGCCGGACA AGGACAGTCG CCGAAACGCA AGGTCGGCAA GGGTGGTACC GATGGTCAGA ACCATCCGCC ACTTTGTCGC CCCAACCGAG GGGTACTCAG GCCGAACGCC GGGACATGGC TCCGGAGACG AGGCATCGGT CCATGCCAAC GACAGCGCGC TAG
|
Protein sequence | MVSPAEVGPA AGFPAADPDA LPARRARHWD IRARLSSMLD DGEAFLAAHP FERGLWLVVS FGAGIVCWLA LPLSVQWTAT LLACGSAALI SLLYVDEHRL PYIRLGIAGV AIMLAAGMAT VWTKSALVGQ PGIGRPMVTT VVGTVLDRRE EPARERSRLL LLTRLPEFPD PVRVRVNLPK DEDRPFIVEG ATLALRARLM PPAPPMLPGA YDFARKAWFD GIAATGTVMD DVRLVSPSGN ATTLRKIQRA LADHVRSRLA GSAGAIAAAF ASGDRGAIAK ADEDAMRDAG LTHLLSISGL HVSALVGAVY WIFARSLALV PWVALRIRVP IAAALAGALA GIAYTLITGA EVPTIRSCIG ALLVLAALAL GRDPLSLRMV AVGALVVMLF WPEAVFGPSF QMSFAAVIAI VAFHSAAPVR GFLAGERHGG LARLARNVLL LLATGLVIEL ALMPIGLFHF HRAGVYGSLA NVIAIPLTTF VTMPLIGIAL LLDLAGLGGP AWWLVETSLD LLLALAHFVA DRPGAVTMLP PGDRWSYLVF VAGMLWLALW TGRVRLWGLL PAFAAALSMA LTPTPDILVT GDGRHVAIAG EGAELLVLRA GRGDFIRENL LELAGMEGET RPLDEWRGAR CGEDFCAASL MRGGRHFAIL MARSRNDVDE MDIAAACERS DIVIADRRLP HTCRPKLLKA DRAFLAMTGG LSIDLSRRRT RTVAETQGRQ GWYRWSEPSA TLSPQPRGTQ AERRDMAPET RHRSMPTTAR
|
| |