Gene Shewana3_1402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewana3_1402 
Symbol 
ID4478999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. ANA-3 
KingdomBacteria 
Replicon accessionNC_008577 
Strand
Start bp1617429 
End bp1618718 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content50% 
IMG OID639725957 
Producthypothetical protein 
Protein accessionYP_869042 
Protein GI117919850 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.769269 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCACT TACCCACTAG CTTGCTGCTG TCGACACTGG CTGTCGCGCT GTCAGCCCAG 
GCACATAATC TCGTCCCTGC GGCCAAGCAA ACCCAGAGCG TGTTGATTAA AAATGCCACA
GTGCATACCG TCAGCCAAGG GACATTAACC AATACCGATG TGTTGATTGA GCAAGGTAAA
ATTAGCGCCA TCGGGCCACA GCTAAGCGTC AATGGCGCTG ATAGCCAAGC GCAGGTCATC
GATGCCACGG GTAAACACTT ATACCCCGGC CTTATCGCAC TCGATACCAG TCTCGGTTTA
GTCGAAATCG AGATGGTGCG CCCAACCGTC GATAATGCAG AAGTCGGTGA TTTTAATCCG
CAAATTAGCG CGGCGACGGC TTACAACCCA GATTCAGAGT TATTGCCGAC CATTCGTTAC
AATGGCATCA CCCACGCCCA AATCGTCCCC AGTGGGGATG GATTAGCCGG CCAATCTGTG
ATCGTCGACC TCGATGCTTG GACGATTGAG GACGCCCTGC AACCAAGCGA AGGCCAATTC
CATGTTTACT GGCCACAAAT CAAGCGCATG CCTGAAGATG AAAAGGAAAA AGCCAAGGCC
ATAGAGAAAA ATCAGCAGGC CATAGATAAA CTGACCACAG CCTTTGAGGA TGGCTATCGC
TACTTTTTAA GTCATAAAGC AAAGGATTCG GCCAAGACAA CCAATTTACG TTGGCAGGCC
ATGTTGCCCC TATACCAAGG CAAGGCCACG CTATTTGCCC ATGCGGACAG CGTCAGCCAA
ATTGAGCAAG TCATTGCCCT GACTAAAAAA TATCAATTTA AATTGGTGAT TGTCGGAGGC
TATGATGCGT GGCGATTAGC TTCAAGCTTG AGGGAAGTTA ACGCCAGTGT GATTTACCCG
CACACCCTGA GTCTGCCCAA ACGAAAAGAT GAACCTGTGG ATTTACCCTT TAAAATCCCT
TCGTTACTGG CAAGTGCAGG CATTCCCTAT GCCCTTGGAT TTTCATCGGA TTGGAACAGT
CGTAACCTTC CCTATGCCGC GGGCTACAGT GCCGCTTACG GCGTCACGCC CGAGCAAGCA
TTAAAATCGG TGACCTTAGA TGCAGCAAAA CTGCTGGGGA TCACCGATTT AGGCGCCATA
GCCATCGGTT ATCAGGGCAG TGTTGTGCTG AGTGACGGGG ATATCCTCGA CCCTATAAGC
AACAAAATTA ATGCGATCTG GATTGAAGGA CGGCAAATAG ATCTGAATAA TCGCCACCAG
CAGCTTTATC AAAAGTATCT TAAGCGCTAG
 
Protein sequence
MKHLPTSLLL STLAVALSAQ AHNLVPAAKQ TQSVLIKNAT VHTVSQGTLT NTDVLIEQGK 
ISAIGPQLSV NGADSQAQVI DATGKHLYPG LIALDTSLGL VEIEMVRPTV DNAEVGDFNP
QISAATAYNP DSELLPTIRY NGITHAQIVP SGDGLAGQSV IVDLDAWTIE DALQPSEGQF
HVYWPQIKRM PEDEKEKAKA IEKNQQAIDK LTTAFEDGYR YFLSHKAKDS AKTTNLRWQA
MLPLYQGKAT LFAHADSVSQ IEQVIALTKK YQFKLVIVGG YDAWRLASSL REVNASVIYP
HTLSLPKRKD EPVDLPFKIP SLLASAGIPY ALGFSSDWNS RNLPYAAGYS AAYGVTPEQA
LKSVTLDAAK LLGITDLGAI AIGYQGSVVL SDGDILDPIS NKINAIWIEG RQIDLNNRHQ
QLYQKYLKR