Gene Shewana3_3720 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_3720 
Symbol 
ID4479921 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp4463658 
End bp4464698 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content49% 
IMG OID639728324 
Producthypothetical protein 
Protein accessionYP_871344 
Protein GI117922152 
COG category[C] Energy production and conversion
[G] Carbohydrate transport and metabolism 
COG ID[COG1819] Glycosyl transferases, related to UDP-glucuronosyltransferase 
TIGRFAM ID[TIGR00661] conserved hypothetical protein 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.588173 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGAATAC TCTACGGAGT TCAAGGCACA GGGAATGGCC ACCTAAGCCG TGCTCGAGTG 
ATGGCAAAAG CCTTAATTGA GCACAATATT CAAGTCGACT TTTTGTTTTC GGGGCGTAAG
CCTGAACATT TTTTCGATAT GGAGTGTTTT GGGGAGTATC GCGTACAGGC GGGAATGACC
TTTGCAACCC ACTCTGGGCG GGTGAATGTG CCGCAAACGG TAAGACAAAA TTGCTCTTTG
TCATTGCTTA AGGATATCCA AGCATTAGAT TTGAGTTGCT ATGACTTAGT GCTGAATGAT
TTTGAACCCG TATCCGCATG GGCGGCGAGG CGTCAAGGCG TCCCTTCCAT TGGCATAAGT
CATCAAGCGG CCTTGACGCA TCCAGTGCCT AAGTTGGGAA GCACTTGGTT TAATGAGTTA
CTACTCAACT ATTTTGCGCC AGTAGATGTG GCACTGGGGT GCCATTGGCA TCATTTTGGT
TTTCCGATCC TACCTCCCTT TGTTGAAGTC GATGCCAGTC CTATTGAACA TACCCATCAA
ATTTTGGTGT ATTTACCCTT CGAAGAGGCG GATGCGATCG CCGCATTTTT TAAGCCATTT
ACGGATTATC AGTTCTTGGT GTATCACGCT AAGCAGCCGA CAACACCGCT TGCCGACCAT
ATTCAATGGC ATGGTTTTAA TCGTGACGGA TTTAAACAGC ACTTAGCGAG CTGCGGTGGG
GTGATTGGTA ATGCCGGATT TGAGCTGGCG AGCGAGGCGC TGACCTTAGG GAAAAAGTTG
TTGGTCAAGC CGCTGATTGG TCAATTTGAA CAGTTGTCGA ATGTGGCTGC GCTCCAATTA
TTGGGCGCAG GTGACAGTAT GATGAGTCTG GATACGGGCG TGGTCAAACG TTGGCTCAAG
GCGGCATCGC CAAATCCCAT CACCTATCCA CAGGTGGGCG ATGCCTTAGT GAAATGGATT
TGCAGCGGTC AGTGGCAACA TACCGCGTCA TTGTGCGATG ACCTTTGGAG TCAAGTGAAG
CTGCCCGACA CTTGGCGCTA A
 
Protein sequence
MRILYGVQGT GNGHLSRARV MAKALIEHNI QVDFLFSGRK PEHFFDMECF GEYRVQAGMT 
FATHSGRVNV PQTVRQNCSL SLLKDIQALD LSCYDLVLND FEPVSAWAAR RQGVPSIGIS
HQAALTHPVP KLGSTWFNEL LLNYFAPVDV ALGCHWHHFG FPILPPFVEV DASPIEHTHQ
ILVYLPFEEA DAIAAFFKPF TDYQFLVYHA KQPTTPLADH IQWHGFNRDG FKQHLASCGG
VIGNAGFELA SEALTLGKKL LVKPLIGQFE QLSNVAALQL LGAGDSMMSL DTGVVKRWLK
AASPNPITYP QVGDALVKWI CSGQWQHTAS LCDDLWSQVK LPDTWR