Gene Cwoe_3812 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCwoe_3812 
Symbol 
ID8734267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameConexibacter woesei DSM 14684 
KingdomBacteria 
Replicon accessionNC_013739 
Strand
Start bp4049829 
End bp4050926 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content69% 
IMG OID646504434 
ProductNMT1/THI5 like domain protein 
Protein accessionYP_003395604 
Protein GI284045264 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0715] ABC-type nitrate/sulfonate/bicarbonate transport systems, periplasmic components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.424332 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCTG CCCGTCGCCT GCGGTCCCTC CGCCTGCTTG CCCTGGCCGT CTTCGCGCTG 
GTCGGCCTGG TCGTCGTGTC TGGCTGCGGC AGCAGCGATG ACGACTCTGG TGGCGGCCAG
ACGACCGCGG CTGCTGGAGG CAGCGGCGGC GGCGAGACGT CGAAGATCAA GCTCCAGTAC
GGCTGGACGG TCGACGAAGG GTTGATCGGC GAGGTCGTCG CGATCGAGGA GGGCTTCTTC
GAGGCCGAAG GGCTCGACGT CGAGATAGTC CCCGGCGGCC CGAACAACGA CGGCGTCGCC
TCGGTCGCCT CCGGACGAGC CCAGATCGGC GTCGCGTCCG AGAGCCCGCC GGTGATGCTC
GCCGCCTCCC AGGGGATCCC GGTGCAGGCG TTCGCCGCGC AGCTCCAGTC GCACCCGTAC
GCCTACTTCG CGCTGCCGGA CACGCAGCTC GACTCGCCCG AGGACCTGAA GGGCAAGTCG
GTCGGCGTGC CGCCGCCGGC CGTCGGCATG CTCGACGCGT ACCTGAAGGA CAACGGCATG
ACGAAGGACG ACCTCGACAG CGTCAAGTCG GTCTCCTTCG ACGTCGCGCC GCTGCTGCAG
AGACGCGTCG ACGTGTGGGG CGGCTGGCTG ACCGACCGCG CGCAGCTGAG ACTGCTGCCG
GAGGGCTACA GAGTTCTGCC GTACGCCGAG AGCGTCCCGC TCTACGGCGG CACCTACTAC
GCCAACCCGA GATTCCTCGC CGGCGACAGA GACAAGGCCG AGGCGTTCCT GCGCGCGGTC
GCCAGAGGCT GGGCGTTCGC GAAGAGAGAC CCGGAGGCGG CGGCGAGAAT ATTCGTCGAG
GCGTACCCCA ACTCCGAGGG CAAGTCGACG ATCGAGTCGA TCGTTGAGGC GCAGGAGACG
CTGTTCCCGT TCATGTGGAC GGAGACGAGC GAGACCGGCG GCTACGGCGC GATGGACCCG
GCGGCCTGGC AGGAGCAGCT CGACCTGTGG GAGCAGACCG GCCAGTTCGA CAGAGGTGAC
GTCCCGACGG TCGAGGAGGT CATGACGACC GACATCCTCG ACGCCACCAG AGATGACCGC
ACTGCACTCG CGAAGTGA
 
Protein sequence
MRAARRLRSL RLLALAVFAL VGLVVVSGCG SSDDDSGGGQ TTAAAGGSGG GETSKIKLQY 
GWTVDEGLIG EVVAIEEGFF EAEGLDVEIV PGGPNNDGVA SVASGRAQIG VASESPPVML
AASQGIPVQA FAAQLQSHPY AYFALPDTQL DSPEDLKGKS VGVPPPAVGM LDAYLKDNGM
TKDDLDSVKS VSFDVAPLLQ RRVDVWGGWL TDRAQLRLLP EGYRVLPYAE SVPLYGGTYY
ANPRFLAGDR DKAEAFLRAV ARGWAFAKRD PEAAARIFVE AYPNSEGKST IESIVEAQET
LFPFMWTETS ETGGYGAMDP AAWQEQLDLW EQTGQFDRGD VPTVEEVMTT DILDATRDDR
TALAK