Gene NATL1_01971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNATL1_01971 
Symbol 
ID4779127 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. NATL1A 
KingdomBacteria 
Replicon accessionNC_008819 
Strand
Start bp182340 
End bp183911 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content33% 
IMG OID640083461 
Producthypothetical protein 
Protein accessionYP_001014026 
Protein GI124024910 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGATATTGA CCGAGAGCAT TTCCCACAAC TGTAGGCCTC CAATTTTATG GATTTTTCTT 
TTTTGGACCA TAGCTTGTGG AATTGCATTT GTCTCGCTGG GTAATTTGCC TTTAAGAGAT
TTTGATGAAG CAACTGTCGC AAGAGTTGCA TTGGAATTAA ACCAAAAAAG TGGACTAGAA
CGATTGCTTC CTTCTATCTG GGATAAGCCC TATTTGAATA AACCCCCAGG GTTGCATTGG
ATAATATCTT TTGCAATTGG AATTAGTAGA AGTTTCCAAA ATAATTTTGA TTTTTTACCC
TCTGAGTTTT GTATAAGATT TTTCCCAGCA CTATTTTCGA CTTTTGTTGT TCCATTGGGT
GGTCTTATTC AGTGGAACTT GCGCCCAAAA GATCGAATAG CATGCATAAC TACATCAGCA
ATTTTATTGA CTTTGTTGCC AATTATTAGA TATGGGCGAA TGGCAATGTT GGATGGAACG
CAGCTTAGTG CTATTGCACT TTTATGGTTT TGCTTATCAT CTATAAAAAA TAATAGGCCA
ACTAATTTTA ATTTTTTAGG AGCTGGATTT ACATGTAGTT TCATGCTTTT ACTTAAAGCC
CCTGTAATTA TTCCTGCACT ATTTGCATCT TTATTACCTT TAATTTGGGA ATATAATTCA
AAAAAATATT TTAATAATTT TTCATGGTCT TGGTTCTTCT ATGGACTAAT TCCAGGTTTT
GCTTGGCATG TATGGAATTT CATTTCATAT GGTTCAGGAG CTTTTTGGTT GTGGTGGGGA
GATGGGGCAG GAAGAGTTTT ATTTGAAAAA GGCTCAGGTA GCGAGCTAGG AGTTTTGGTA
CCAATAATTG AAATACTTGA AGGGGGATGG CCTTGGATTC TTCTATGGCC AATTGGTTTT
TTGTGGGCAT GCTTGAGCCT TAATACTCGT TGGGGAGTTT GGGCTTTAAG TACTCAGATT
ATTATTTTAG GAAGTATTTT ACCTCTAAAA ATGCAACTTC CTTGGTATAT TCATCCATTT
TGGTTGCCCT TTGCTTTGGT ATGCGGACCT CCAGTTTCTT GGTTAATTCA AAGAGAAGAG
AACGGTTATA TTTTCGCTAG AAAAATTTTA AGAAAAATCC CATATATATT TTCTTTAATT
GGACTATGCA TATTGACTTT TTCTTTATTA CTTAAGTTAA ATATTTTCAA CATTGGAGAA
GGTTACTTTT ATTCAATTTT CTTTATAAGT TTAGCTTGGT TTATTGGGGG ATTATTATTA
TCTAATTCAA GAAAGAATAT TAGAAAAATT GGTTTTATTG GATTGATTGT TGGAAGCATA
ATAGGCTTAT TCTTTTTTGT GAGTTCAAAA TTTTGGTTAT GGGAAATAAA TGAAAATTGG
GATGTAAGAC CTGTAGCTGA ATTTATAGAT GGCTTTCCTA ATCAACAAAT TTTTATCAAA
AATAGCTTTG AGAGACCAAG TTTAAATTGG TATGCAGGAA AACAAATCAA AAGTTTTGAC
GAGGAAGATA AAACTAAATG CAAAGTAATT AAGAAAACTA ATGCTTGGGA TCTCTATACA
TGTAATGATT AA
 
Protein sequence
MILTESISHN CRPPILWIFL FWTIACGIAF VSLGNLPLRD FDEATVARVA LELNQKSGLE 
RLLPSIWDKP YLNKPPGLHW IISFAIGISR SFQNNFDFLP SEFCIRFFPA LFSTFVVPLG
GLIQWNLRPK DRIACITTSA ILLTLLPIIR YGRMAMLDGT QLSAIALLWF CLSSIKNNRP
TNFNFLGAGF TCSFMLLLKA PVIIPALFAS LLPLIWEYNS KKYFNNFSWS WFFYGLIPGF
AWHVWNFISY GSGAFWLWWG DGAGRVLFEK GSGSELGVLV PIIEILEGGW PWILLWPIGF
LWACLSLNTR WGVWALSTQI IILGSILPLK MQLPWYIHPF WLPFALVCGP PVSWLIQREE
NGYIFARKIL RKIPYIFSLI GLCILTFSLL LKLNIFNIGE GYFYSIFFIS LAWFIGGLLL
SNSRKNIRKI GFIGLIVGSI IGLFFFVSSK FWLWEINENW DVRPVAEFID GFPNQQIFIK
NSFERPSLNW YAGKQIKSFD EEDKTKCKVI KKTNAWDLYT CND