Gene Shewana3_1980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1980 
Symbol 
ID4476344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp2358326 
End bp2359486 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content48% 
IMG OID639726562 
Producttetratricopeptide repeat protein 
Protein accessionYP_869617 
Protein GI117920425 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000007627 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000000000727651 
Fosmid HitchhikerNo 
Fosmid clonabilityunclonable 
 

Sequence

Gene sequence
ATGCTTGAGA TCCTCTTCCT GCTGCTTCCT ATTGCTGCCG GTTACGGTTG GTATATGGGG 
CGGCGGAGCA TAAGGCAAAA CCAGAGTAAT CAGCGTAAGC AATTAAGTCG TGATTATTTC
ACCGGCTTGA ATTTCCTGTT GTCAAACGAG TCAGACAAAG CGGTCGACTT GTTTATCAGT
ATGCTCGATG TGGACGATGA AACCATCGAT ACTCATCTTT CCCTCGGGTC GTTATTCCGC
AAACGCGGTG AAGTTGACCG CTCCATTCGT ATCCATCAAA ACTTAATCGC ACGTCCCACA
CTCACCAATG AGCAGCGCGA TATCGCGATG ATGGAACTGG GTAAAGATTA CCTTGCGGCG
GGGTTTTACG ACCGTGCAGA AGAAATTTTC CTGAATTTGG TGAGTCAAGA TGACCATAGC
GAAGAGTCTG AGACGCAGCT GATTGCCATT TATCAAGTCA TTAAGGAATG GCAAAAGGCC
ATCGACATCA CCAAGCGCTT AAGCCGCAAG CGTCAGCAGG TGCTTAAACC GATTATCGCC
CATTTTTATT GCCAGCTTGC GGATGAAACC AGCGACGATG CTGACAAGAT CAAGTTATTA
CTACAGGCCT TAAAACAAGA TCCTAAGTGT GGCCGAGCCT TACTCACGCT CGCCAAAAAA
TTCCTCGACG CCAAGGATTA TAACCAGTGC AAATCTATGC TGATGGCACT GAAAAAGGCT
GACATAGAAC TTTTTGCCGA TGCCTTACCA ACGGCCAAGC AAGTGTATCG CGATACCCAA
GATAAAGAGG GCTATCAAGA ATTATTAGCG GGCGCGATGG CCGAAGGGGC GGGCGCCTCC
GTGGTGGTAG CGCTTGCTCA ACATATGATT AGTCTCGATG AGATAAAAAC TGCTGAAAAC
ATGGTGTTGG ATGCCCTGTA TCGCCATCCC ACCATGAAGG GGTTTCAGCA CTTAATGCAG
ATGCACCTGC GTCAAGCTGA AGAAGGGCAA GCCAAACAAA GTTTGACTAT GCTTGAGCAA
CTTGTTGAAC AACAAATTAA ATTCCGTCCA AGTTATCGCT GTAAAGAATG CGGTTTCCCA
TCCCACACCC TGTATTGGCA TTGCCCCTCC TGTAAAAAAT GGGGCACCAT TAAACGGATC
CGTGGCTTAG ACGGTGAATA A
 
Protein sequence
MLEILFLLLP IAAGYGWYMG RRSIRQNQSN QRKQLSRDYF TGLNFLLSNE SDKAVDLFIS 
MLDVDDETID THLSLGSLFR KRGEVDRSIR IHQNLIARPT LTNEQRDIAM MELGKDYLAA
GFYDRAEEIF LNLVSQDDHS EESETQLIAI YQVIKEWQKA IDITKRLSRK RQQVLKPIIA
HFYCQLADET SDDADKIKLL LQALKQDPKC GRALLTLAKK FLDAKDYNQC KSMLMALKKA
DIELFADALP TAKQVYRDTQ DKEGYQELLA GAMAEGAGAS VVVALAQHMI SLDEIKTAEN
MVLDALYRHP TMKGFQHLMQ MHLRQAEEGQ AKQSLTMLEQ LVEQQIKFRP SYRCKECGFP
SHTLYWHCPS CKKWGTIKRI RGLDGE