Gene Shewana3_1692 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1692 
Symbol 
ID4477202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1990179 
End bp1991489 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content50% 
IMG OID639726275 
ProductXaa-His dipeptidase 
Protein accessionYP_869331 
Protein GI117920139 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATGCC CAGTGCTAAT CTTTGTCTAT CCTGAGTGTG ATGCGTTATC GAGGCCAAAA 
GTGCAAGCCG TGTTAGACAC TATTATCGAC AGTAAACAAC GTGATACCCG ACTGATGTGG
GCCCTGTGTG TGGCTTCCGT GGTGGTGTAT ATCAATCTGT ATTTGATGCA GGGCATGTTA
CCGCTGATCG CCGAGCATTT TGCGGTATCG GGCTCTAAGG CAACGCTTAT CCTTTCGGTC
ACCAGCTTTT CGCTGGCGTT TTCGCTGTTA ATTTATGCGG TTGTGTCCGA CAGAATTGGC
CGCCACACGC CGATTGTCGT GAGTCTCTGG CTACTGGCGC TGTCGAATCT GCTGTTGATT
TGGGCTGGGG ATTTTAATGC TCTTGTCTAC GTCCGCTTTT TACAGGGCGT GCTGTTAGCG
GCGGTGCCCG CCATTGCCAT GGCCTATTTT AAGGAGCAAC TCTCGCCAAG CACTATGCTC
AAAGCCGCGG GTATTTATAT CATGGCCAAC AGTATCGGCG GGATTGTCGG TCGGTTACTG
GGCGGGGTGA TGTCGCAGTT TTTATCTTGG CAAGAGTCCA TGTGGCTGCT GTTTTTAGTT
ACGCTTGCGG GCGTTGCCTT AACCAGTTAT TTATTGCCTT CTGGCGCCGA TGCGCAGGCG
GTATCGGGCG GACAAACCAC CTCGCCAACA CTGTCAAAAC GGGCACGTTT ATTACAGGAT
ATTTATGGCT TTAGCCATCA CCTAACCGAT CCACAGATGC GTTTAGCCTA TGCCATCGGT
GGGATCACTT TTATGATGAT GGTGAATCAA TTTAGCTTTA TTCAGCTGCA TTTGATGGCC
GCACCTTACG AGTGGAGCCG TTTCCAAGCG ACGTTGATCT TCCTGTGTTA TTCCAGTGGT
ACCGTGGCTT CTTATTTTAC TGCCAAGTGG CTGGCCAAAT TTGGTCAGCA CAAGTTATAT
CAATGGTCTT GGTGCTTGAT GTTACTGGGC AGTTTATTGA CCCTGTTTGA TACGCCAGTC
ACGATTAGCC TAGGCTTTTT GATGACGGCC TGTGGCTTCT TCTTAACCCA CAGCTGCTGT
AACTCCTTTG TGGCGATGCG CGCAAGCCGT GACCGCGCTA AAGCCACCTC GCTGTATTTA
TGTTGCTATT ACTTAGGCGC CGCGCTGGGC GGGCCTTACC TGATGCTATT TTGGCATAAA
GCCGAGTGGC AGGGAGTGGT GATGGGATCA TTAACTCTCC TTGCATTAAT CGCCTTAGCT
ATCGGTCGAT TGCGTTATCA CCAGACCCAG ATGAGCCGCG TGACGCTCTA G
 
Protein sequence
MLCPVLIFVY PECDALSRPK VQAVLDTIID SKQRDTRLMW ALCVASVVVY INLYLMQGML 
PLIAEHFAVS GSKATLILSV TSFSLAFSLL IYAVVSDRIG RHTPIVVSLW LLALSNLLLI
WAGDFNALVY VRFLQGVLLA AVPAIAMAYF KEQLSPSTML KAAGIYIMAN SIGGIVGRLL
GGVMSQFLSW QESMWLLFLV TLAGVALTSY LLPSGADAQA VSGGQTTSPT LSKRARLLQD
IYGFSHHLTD PQMRLAYAIG GITFMMMVNQ FSFIQLHLMA APYEWSRFQA TLIFLCYSSG
TVASYFTAKW LAKFGQHKLY QWSWCLMLLG SLLTLFDTPV TISLGFLMTA CGFFLTHSCC
NSFVAMRASR DRAKATSLYL CCYYLGAALG GPYLMLFWHK AEWQGVVMGS LTLLALIALA
IGRLRYHQTQ MSRVTL