Gene Shewmr4_3100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_3100 
Symbol 
ID4253671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3709731 
End bp3711011 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content50% 
IMG OID638119742 
Productphosphate-selective porin O and P 
Protein accessionYP_735228 
Protein GI113971435 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3746] Phosphate-selective porin 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTAAAT TGACCTTACT TGCCGCGGCA ACCGCTGTGG CTTTTACTGG TTTTGCGCAT 
GCGCAAGACT TAACGCCACA AGAGCTTCAA GCACAACTCA CCCAATTGAC TCAAGAAGTA
AACCTGCTAA AACAACAGCA GGCAGTGAAT GCCGCCGCGA CTCCATCTGA GCCTAGTGCG
CCAGCCTTTG AGATAGGTGG TCGTATTCAA CTCGATTACA ACCTCTTCAA TGGTGCCTAT
AACGCCGAAC ATAACGGCAG TCGCGCGCAG GAAATTTTCC CCCGCCGAGT ACGTACCTTT
GTGGAAGGTG AATTATCGGA TTGGGATTAC AAGCTACTGC TTGAATTTGC TGAAAATACT
GCTGAAATCG TCATGGCTAG GATGCGTTAC TCGGGTTTTG AGAATGGCCC TAAACTGCAA
TTGGGTAAGT TGCGTGAGGA TATTAGCCTC GATGCGCTCA CCAGCAGTAA TCACATTGCG
CTGATCGAAC GTTCTAGCCT TGCCGATACT ATGTCACCCT ATTTCCGCTG GGGTGTTTCT
GCCTACCAAT ATTTCCCCAC CACGGGTTTA CGTTACGCCT TGGGTGTTTA TAAAAACGAT
GCCTTTGGTA GCGATGGCAA AGATGACACC GATGACTTGA ATTTTGCCCT GAGCTCGCGA
GTGACCTGGT CACAGGCCAC TGAACCGGGC CAAGTGCTGC ATGCGGGCTT GTGGTACTCG
GCGCGCCAAA TGGACGGCGA TAACTTGTCT GCCAGCTTTG CTCGCGGCGA ACTGCGAGAG
ACCAATACTC GCCTGATTAA CTATGTTGCG GGTGGTGAAA CGGCGGCGAT AGATCGGCTA
AATCAAGTGG GTTTAGAGCT GGCATTCCAA CATAAGGCTT TGTTATTGCA AGGCGAATAC
GCCCAGCGGA ATTTAACGAC CCAAGATCCT CTGTCTGTGC TTGATGGTGA GCGTTATGAG
GCGTATTACC TGCAGGCGAG TTACTTCTTA ACGGGCGAAC AGCGCAGCTA TTCCAAGGGG
AGCGCGGTGT TTTCTCAACC TAAGGGCGTA ACCGATGCAT GGGAATTGGC GGCACGTTTC
TCTAGCGTGG ATGCAAGCTC CGACTTTCAA GGCACTCAAG CCCAAACCTA TACCTTAGGT
GTGTCTTACT ACTTCAATCC AAAAATCAAA GTGATGGCTA ACTACATTCA TTCTTCCGTC
GATGGGGCAG GCACGATTGC CTTAGTCGGC AATGAAGATG TGGGCGATGC CTTTGCTGCC
CGTTTGCAAT ATGTGTTTTA G
 
Protein sequence
MPKLTLLAAA TAVAFTGFAH AQDLTPQELQ AQLTQLTQEV NLLKQQQAVN AAATPSEPSA 
PAFEIGGRIQ LDYNLFNGAY NAEHNGSRAQ EIFPRRVRTF VEGELSDWDY KLLLEFAENT
AEIVMARMRY SGFENGPKLQ LGKLREDISL DALTSSNHIA LIERSSLADT MSPYFRWGVS
AYQYFPTTGL RYALGVYKND AFGSDGKDDT DDLNFALSSR VTWSQATEPG QVLHAGLWYS
ARQMDGDNLS ASFARGELRE TNTRLINYVA GGETAAIDRL NQVGLELAFQ HKALLLQGEY
AQRNLTTQDP LSVLDGERYE AYYLQASYFL TGEQRSYSKG SAVFSQPKGV TDAWELAARF
SSVDASSDFQ GTQAQTYTLG VSYYFNPKIK VMANYIHSSV DGAGTIALVG NEDVGDAFAA
RLQYVF