Gene Swit_3494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSwit_3494 
Symbol 
ID5199361 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSphingomonas wittichii RW1 
KingdomBacteria 
Replicon accessionNC_009511 
Strand
Start bp3843232 
End bp3844395 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content70% 
IMG OID640583043 
Productpolysaccharide export protein 
Protein accessionYP_001263978 
Protein GI148556396 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.437041 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTCCC ATCCCAGTCC CCGCCTTCGC CTCACCGGCG CCATGGCGAC CCTCTCGATA 
CTCCTGTCCG GCTGCGCGAC CCTGCCGTCG AGCGGGCCGA CCGGCCGGCA GGTGCTCAAC
GGCGCCAAGA ACCCGGAGGC CGGGCTCGGC TTCGACGTCG TCGAACTCGA CGGCGCGGCG
TTCCAGAAGC TGCAGCAGCT CCAGCCCGCC GGCCGTGCGA CGGGGCAGCT CGCCGCGCTC
GCGCGGGAAG GGCGGGTCGA CCGGATCGCG CCCGGCGACG TGCTCCAGGT CAGCATCTAC
GAAGTCGGCA TGACCCTGTT CAGCAGCGGG CGGACCGCGG GACCGGCCGG CAGCGTCGAA
CCCGACACGG CGCACGCGCA GGCGATCAAT GCGGTGACGG TCGCCAGCGA CGGCACCATC
CGCCTGCCCT ATGTCGGGCG GCTGCTGGTC GCCGGCTCGA CCCCCTATGA CGTCCAGCGG
ATGATCGAGC AGGCGCTGCA GGGCAAGTCG CAGAGCCCGC AGGCGATGGT GACGGTGGTC
AACAGCCCCG GCAACAGCGT CTACATGCTC GGCGACGTGG TGCGCACCGG GCGCATCCCG
CTCACCCCCG CCCGCGAGCG GTTGCTCGAC GCGATCGCGA CCTCGGGCGG GTCGAGGGTC
AGCGGCGCCG ATACGCTGGT CCGGCTCACC CGCGGCGGCG AACATGCCGA GATGCGGCTG
GGCGACGTCC GGCCCGGCAG CAGCGACGAT CTCACCCTGT TGCCCGGCGA CCGCATCGAG
CTGGTCAACG AGCCGCGCAG CTACAGCGCG TTCGGCGCGA CGCCGAAGGT GTCGCAGGTT
CCGTTCGGGG AACCGAACCT GTCGCTGGCC GAGGCGCTGG CGCGGATCGG CGGGCCGAAT
GACGCGCAGG CCGATCCCAA GGCGGTGTTC CTGTTCCGCT ACGATGCCGC GGCGATCGCG
GCGGGCGAGC GGCCGGTCAT CTATCGCCTG AACCTGATGA AGCCCGAAAG CTACATCATC
GCGCAGAACT TCCCGATGCA CGACAAGGAC CTGATCTACA TCGCCAACTC GGCGTCGAAT
CCGGTCACCA AGTTCGTCGC GATTCTCAAC CAGCTGTTCG CGCCGCTGCT GACCGCGAAG
GTGCTGACCG ACAACAGAAA CTGA
 
Protein sequence
MISHPSPRLR LTGAMATLSI LLSGCATLPS SGPTGRQVLN GAKNPEAGLG FDVVELDGAA 
FQKLQQLQPA GRATGQLAAL AREGRVDRIA PGDVLQVSIY EVGMTLFSSG RTAGPAGSVE
PDTAHAQAIN AVTVASDGTI RLPYVGRLLV AGSTPYDVQR MIEQALQGKS QSPQAMVTVV
NSPGNSVYML GDVVRTGRIP LTPARERLLD AIATSGGSRV SGADTLVRLT RGGEHAEMRL
GDVRPGSSDD LTLLPGDRIE LVNEPRSYSA FGATPKVSQV PFGEPNLSLA EALARIGGPN
DAQADPKAVF LFRYDAAAIA AGERPVIYRL NLMKPESYII AQNFPMHDKD LIYIANSASN
PVTKFVAILN QLFAPLLTAK VLTDNRN