Gene P9303_26361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_26361 
Symbol 
ID4777439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2325052 
End bp2326692 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content57% 
IMG OID640088158 
Product4-amino-4-deoxy-L-arabinose transferase 
Protein accessionYP_001018631 
Protein GI124024324 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.874739 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCTCTTGC CATCAGCAGA TCACAGGCCA TCCATTCGCA GAGCACCTCT GCTGGGTTTG 
CTCTTGCTTT GGCTGGTGGC ATGCGTGCTT GCAATCGTGG GGCTAGGAGA TCTACCCCTA
CGCGATTTTG ATGAAGGCAT CGTTGCCAGA GTGGCCTTTG AGTTAAGCCA AAAACAGGGC
CCCGAGGCTC TACTGCCCAC CCTGTGGGAT TCGCCCTACC TCAACAAACC ACCAGGCCTG
CATTGGCTGA TTGCTGCCGC CGTCCAACTC AACAACAACG GCGAAGCCTC TTTAACTCGG
CTCCCCTCAA ACACTCTGGT GAGGCTGGCG CCGGCATTGC TATCCACGCT GATTGTTCCG
TTAGGGGGGC TCGTGCAATG GCATCTAAGA CCAAATGATC GAACGAGCTG CCTTGCTACA
GCTGCAATTT TGCTAACGCT GATGCCAGTG GTACGACATG GTCGGCTAGC CATGCTCGAT
GGTCCACAAC TCAGTGCCAT GGCCTTGTTG TGGTTACTCG TTTTAAGCCT CGACCGCAGC
CCAATGGACC GCTGGCGAAC GCTGGGGGCA GGACTCATCA GCAGTGGCAT GCTTTTGCTC
AAAGCCCCTC TGCTGCTTCC AGCAGCAGCA GCAGCATTGA TTCCTATGCT TTGGGGTGGC
GAATTCAGGC GATGGTGGCG ATGGCCACTA GCAGGCTGGT TTGGAGTAGG GCTGATCCCT
GGCCTCGCCT GGCACCTCTG GCATGGCTTA CAAAGGGGCA CTGGGGCGCT GTGGCTCTGG
GGAGGCGATG GTGCAGGCCG CGTACTTTTT GATGCTGGAG AAGGCAGTGA CCTCGGCTGG
AAAGTCCCAG TAATTGAGAT GCTCGAGGGT GGTTGGCCCT GGCTGTTGCT ATGGCCATTC
GCAATGGCCT GTGCCTGGCG TCAACGACAC AGCCGTTGGG GCAAATGGGC ACTGGGCACA
CAAGCCATAC TAGCTATTGC AATCTTGCCA CTGAAGACCC AACTCCCCTG GTATAGCCAC
CCTCTATGGC TCCCATTTGC ACTGCTCTGT GGAGCAGCAC TGGCCTGGCT CATTCACAGG
AAGGATCTAA AAAATCCTCC TGGTGCAGGA GTTTTAAAAC ATGTTCCGTA CCTCTGGCTG
GCCCTAGGAG TCACCCTTGT TCTATTCGGG CTCATCGGCG CATCAGGGCA ATTTATAACG
TTGCAGCCCT ATAGCGGCAT TGCACTCGCT GCTGGGGTTG GCTGGAGTAT CGGCGGCTGG
CTGATGCTGC GCCCAACCCA TGCCAAACGC AAATTGGGCG CCATCAGCAT GGTTGCTGGA
AGCGTCGCAG CTCTATACCT ATTAATGAGC TCCTCACTCT GGCTCTGGGA ACTCAATGAG
AACTGGCCCG TTGAACCTGT CGCCCAGCTC GCCGCACAAG CAAAGGGAGC GAAGGTGGTG
CTCGAGGGGA ATGATGAAAG GCCCAGCCTC AACTGGTATG CAGGCCAGCG CATCAGCTCC
TTAGATGCTG TTCCAGACGC TGAATGGATC TTGACGAGAA ATCCCCAGCG AATCAGCAGC
ATGGCTCAGG AGCGGCAGTG CAAGCTTGCG CAAAGCAAGG AAGACTGGGC CCTACTTTTT
TGTGGCCCGC AAACCCAATA A
 
Protein sequence
MLLPSADHRP SIRRAPLLGL LLLWLVACVL AIVGLGDLPL RDFDEGIVAR VAFELSQKQG 
PEALLPTLWD SPYLNKPPGL HWLIAAAVQL NNNGEASLTR LPSNTLVRLA PALLSTLIVP
LGGLVQWHLR PNDRTSCLAT AAILLTLMPV VRHGRLAMLD GPQLSAMALL WLLVLSLDRS
PMDRWRTLGA GLISSGMLLL KAPLLLPAAA AALIPMLWGG EFRRWWRWPL AGWFGVGLIP
GLAWHLWHGL QRGTGALWLW GGDGAGRVLF DAGEGSDLGW KVPVIEMLEG GWPWLLLWPF
AMACAWRQRH SRWGKWALGT QAILAIAILP LKTQLPWYSH PLWLPFALLC GAALAWLIHR
KDLKNPPGAG VLKHVPYLWL ALGVTLVLFG LIGASGQFIT LQPYSGIALA AGVGWSIGGW
LMLRPTHAKR KLGAISMVAG SVAALYLLMS SSLWLWELNE NWPVEPVAQL AAQAKGAKVV
LEGNDERPSL NWYAGQRISS LDAVPDAEWI LTRNPQRISS MAQERQCKLA QSKEDWALLF
CGPQTQ