Gene Shew_1957 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShew_1957 
Symbol 
ID4920990 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella loihica PV-4 
KingdomBacteria 
Replicon accessionNC_009092 
Strand
Start bp2262067 
End bp2263227 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content53% 
IMG OID640163526 
Producttetratricopeptide repeat protein 
Protein accessionYP_001094082 
Protein GI127512885 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000379744 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000558993 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTTAGAGA TTCTGTTTCT GTTGTTACCC ATTGCCGCCG GTTACGGTTG GTACATGGGT 
CGCCGTAGTA TCAGGCATAA GCAGAATAGT AAACGTAAGC AGTTAAGCCG CGATTACTTT
ACCGGCCTTA ATTTCCTTCT GTCTAATGAA TCGGATAAGG CAGTCGATCT CTTTATCTCC
ATGCTGGACG TGGATGACGA CACCATTGAT ACCCATCTCT CCCTCGGCTC CTTGTTTCGC
AAGCGAGGCG AGGTAGACCG CTCGATTCGT ATTCACCAAA ACCTGATTGC CCGCCCAACT
CTGACAACCG AACAGCGTGA CATCGCCATG ATGGAGCTGG GCAAAGACTA TCTGGCCGCC
GGCTTCTACG ACAGAGCCGA GGAGATCTTC CTTAATCTGG TTCGCCAGGA AGATCACAGC
GAAGAGGCCG AAGATCAGCT GATCGCCATC TACCAGGTGA CCAAAGACTG GCAAAAAGCA
ATAGATATCA TCAAGAGCCT CAAGCGTAAG CGTCAGCAAT CGCTCAAACA CCTGCAGGCC
CATCTCTATT GTGAGCTTGC CGATGAGGCC AGCGACAGCG AGCTCAAGCT TAAACACCTG
GCACAGGCGA TAAAGCAAGA TCCCCAATGT GGCCGCGCCA TGTTAACCAG CGCCAAGCTG
TTCCTCGCTC AGCAGGAATT TGGCCGCGCC AAGGAGATGC TCTGCCGGTT GAAAGATGCC
GATATCGAAC TCTTTCCCGA GGCGCTCGCC ATCGCCAAAG AAGTTTATCA ATCGACCGAG
GATCTCGGCG CCTATCGTGA ACTGCTACGC GAAGCGTTAG AGCAGGGGGC TGGCGCGAGT
GTGGCCATCA CTCTGGCGCA GCAGATGATC ATTCAGGGAG AAACCCAAGA CGCCGAGAAG
TTGATTCTCG ATGGCCTCTA TCGCCATCCG ACCATGAAGA GTTTCCAGCA TCTGATGAAG
ATGCAGATCC AACACGCCGA AGATGGTCAG GCAAAACAGA GTTTGAACAT GCTCGCCGAA
CTAGTCGAGC AGCAGATAAA ATTCCGTCCC AGTTACCGCT GTATTGAGTG TGGTTTCCCG
TCCCACACCC TCTACTGGCA TTGCCCTTCC TGTAAGAGTT GGGGCACCAT CAAGCGGATC
CGCGGACTCG ACGGGGAGTA A
 
Protein sequence
MLEILFLLLP IAAGYGWYMG RRSIRHKQNS KRKQLSRDYF TGLNFLLSNE SDKAVDLFIS 
MLDVDDDTID THLSLGSLFR KRGEVDRSIR IHQNLIARPT LTTEQRDIAM MELGKDYLAA
GFYDRAEEIF LNLVRQEDHS EEAEDQLIAI YQVTKDWQKA IDIIKSLKRK RQQSLKHLQA
HLYCELADEA SDSELKLKHL AQAIKQDPQC GRAMLTSAKL FLAQQEFGRA KEMLCRLKDA
DIELFPEALA IAKEVYQSTE DLGAYRELLR EALEQGAGAS VAITLAQQMI IQGETQDAEK
LILDGLYRHP TMKSFQHLMK MQIQHAEDGQ AKQSLNMLAE LVEQQIKFRP SYRCIECGFP
SHTLYWHCPS CKSWGTIKRI RGLDGE