Gene Saro_2030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2030 
Symbol 
ID3917677 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2163622 
End bp2165904 
Gene Length2283 bp 
Protein Length760 aa 
Translation table11 
GC content67% 
IMG OID640444782 
ProductComEC/Rec2-related protein 
Protein accessionYP_497303 
Protein GI87200046 
COG category[R] General function prediction only 
COG ID[COG0658] Predicted membrane metal-binding protein 
TIGRFAM ID[TIGR00360] ComEC/Rec2-related protein
[TIGR00361] DNA internalization-related competence protein ComEC/Rec2 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTGTCTC CGGCAGAAGT TGGCCCAGCA GCAGGATTTC CTGCCGCCGA TCCGGATGCT 
CTGCCAGCGC GACGCGCGCG GCATTGGGAC ATCCGCGCGC GATTGTCCAG CATGCTTGAT
GATGGAGAGG CGTTCCTTGC CGCCCACCCG TTTGAACGCG GCCTCTGGCT GGTGGTTTCC
TTCGGCGCGG GGATCGTATG CTGGCTCGCG CTTCCCCTGT CGGTCCAGTG GACGGCGACG
CTGCTGGCCT GCGGTTCTGC CGCGCTGATC TCCCTGCTGT ACGTCGACGA ACACCGCCTT
CCATACATCC GGCTCGGCAT CGCGGGCGTG GCGATCATGC TGGCGGCCGG CATGGCAACG
GTGTGGACCA AATCCGCCCT GGTGGGCCAG CCCGGCATCG GTCGGCCGAT GGTGACCACT
GTGGTTGGAA CGGTCCTCGA CCGGCGCGAG GAACCGGCCA GGGAACGCTC GCGCCTTCTG
CTGCTGACGC GCCTGCCCGA GTTTCCCGAT CCTGTTCGCG TCAGGGTGAA CCTTCCGAAG
GACGAAGACC GGCCCTTCAT CGTGGAGGGC GCTACGCTTG CCTTGCGCGC GCGATTGATG
CCGCCCGCGC CGCCCATGCT GCCCGGTGCC TATGATTTCG CCCGCAAGGC CTGGTTCGAT
GGAATCGCGG CCACCGGGAC GGTCATGGAC GATGTCCGGC TTGTTTCGCC GTCCGGCAAC
GCCACCACCT TGCGGAAAAT CCAGAGGGCC CTGGCGGATC ACGTCCGCTC TCGTCTCGCC
GGTTCCGCAG GGGCGATAGC CGCGGCCTTT GCCAGCGGAG ACAGGGGCGC AATCGCCAAG
GCGGACGAGG ATGCGATGCG CGATGCCGGG CTTACCCATC TGCTGTCGAT CAGCGGCCTC
CATGTCAGTG CGCTGGTGGG GGCCGTTTAC TGGATTTTCG CGCGATCGCT TGCGCTGGTC
CCCTGGGTCG CGCTGCGCAT CCGCGTACCG ATCGCCGCCG CGCTGGCGGG TGCGCTTGCC
GGCATTGCCT ACACCCTCAT CACCGGGGCC GAAGTGCCCA CCATCCGTTC GTGCATAGGG
GCCCTGCTCG TGCTCGCGGC ACTTGCGCTC GGGCGTGATC CGCTCTCGCT GCGCATGGTC
GCGGTCGGTG CCCTTGTCGT CATGCTGTTC TGGCCGGAAG CCGTGTTCGG GCCCAGTTTC
CAGATGAGCT TTGCTGCGGT GATCGCCATC GTCGCCTTCC ATTCCGCCGC GCCCGTCCGC
GGATTTCTCG CCGGCGAACG ACATGGGGGC CTGGCGCGGC TTGCCCGCAA CGTCCTGCTT
CTGCTGGCAA CCGGCCTGGT CATCGAACTC GCGCTCATGC CCATCGGCCT GTTCCACTTC
CACCGGGCGG GCGTCTACGG ATCGCTGGCC AATGTCATTG CCATCCCGCT CACCACATTC
GTGACCATGC CGCTGATCGG CATCGCGCTG CTTCTCGACC TTGCCGGGCT GGGTGGACCC
GCGTGGTGGC TCGTCGAAAC GTCGCTGGAT CTTCTGCTGG CACTCGCGCA CTTCGTCGCC
GACCGGCCGG GGGCGGTCAC CATGCTGCCG CCGGGGGATC GGTGGAGCTA CCTCGTCTTT
GTCGCCGGAA TGCTGTGGCT GGCGCTCTGG ACCGGCCGCG TCAGGCTGTG GGGCCTCCTG
CCCGCATTCG CCGCAGCGCT TTCCATGGCG CTGACGCCCA CGCCCGACAT CCTCGTCACC
GGGGACGGCC GGCATGTCGC CATCGCGGGC GAGGGGGCGG AACTCCTAGT CCTGAGAGCG
GGGCGGGGAG ACTTCATCCG CGAAAATCTT CTGGAACTGG CCGGAATGGA GGGGGAGACG
CGCCCCCTCG ACGAGTGGCG TGGTGCGCGC TGCGGTGAAG ATTTCTGTGC CGCGTCGCTC
ATGCGGGGCG GACGCCACTT TGCTATTCTC ATGGCCCGCA GCCGCAACGA CGTCGACGAG
ATGGACATTG CCGCCGCCTG CGAGCGAAGC GACATCGTGA TCGCCGACCG TCGCCTGCCG
CACACCTGCC GCCCGAAGCT GCTCAAGGCA GACCGCGCGT TCCTCGCAAT GACCGGCGGC
CTCTCCATCG ACCTAAGCCG CCGCCGGACA AGGACAGTCG CCGAAACGCA AGGTCGGCAA
GGGTGGTACC GATGGTCAGA ACCATCCGCC ACTTTGTCGC CCCAACCGAG GGGTACTCAG
GCCGAACGCC GGGACATGGC TCCGGAGACG AGGCATCGGT CCATGCCAAC GACAGCGCGC
TAG
 
Protein sequence
MVSPAEVGPA AGFPAADPDA LPARRARHWD IRARLSSMLD DGEAFLAAHP FERGLWLVVS 
FGAGIVCWLA LPLSVQWTAT LLACGSAALI SLLYVDEHRL PYIRLGIAGV AIMLAAGMAT
VWTKSALVGQ PGIGRPMVTT VVGTVLDRRE EPARERSRLL LLTRLPEFPD PVRVRVNLPK
DEDRPFIVEG ATLALRARLM PPAPPMLPGA YDFARKAWFD GIAATGTVMD DVRLVSPSGN
ATTLRKIQRA LADHVRSRLA GSAGAIAAAF ASGDRGAIAK ADEDAMRDAG LTHLLSISGL
HVSALVGAVY WIFARSLALV PWVALRIRVP IAAALAGALA GIAYTLITGA EVPTIRSCIG
ALLVLAALAL GRDPLSLRMV AVGALVVMLF WPEAVFGPSF QMSFAAVIAI VAFHSAAPVR
GFLAGERHGG LARLARNVLL LLATGLVIEL ALMPIGLFHF HRAGVYGSLA NVIAIPLTTF
VTMPLIGIAL LLDLAGLGGP AWWLVETSLD LLLALAHFVA DRPGAVTMLP PGDRWSYLVF
VAGMLWLALW TGRVRLWGLL PAFAAALSMA LTPTPDILVT GDGRHVAIAG EGAELLVLRA
GRGDFIRENL LELAGMEGET RPLDEWRGAR CGEDFCAASL MRGGRHFAIL MARSRNDVDE
MDIAAACERS DIVIADRRLP HTCRPKLLKA DRAFLAMTGG LSIDLSRRRT RTVAETQGRQ
GWYRWSEPSA TLSPQPRGTQ AERRDMAPET RHRSMPTTAR