Gene Gdia_0555 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0555 
Symbol 
ID6973952 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp611462 
End bp612808 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content41% 
IMG OID643390088 
Productphage portal protein, HK97 family 
Protein accessionYP_002274964 
Protein GI209542735 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0158159 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.0015106 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATTTC AAACAACACA ATTAGACGGA ATTCTAAGCA GCTTTAAAAG CGCGATTAAC 
AGTTATGTTG AACCTCCAAT AAATCCCGGC ATTTCTGGCT TTATCGGAAT GATGCCAACG
GCTTCTGGTA TCTCGATCAA TGTTGAGAAC ATGCTGCAAT CATCAGCAGT CTCGTCCTGT
CTAAGGATAA TAATCGACGA TATCGGAAAG CTTGATATAA ACCTCGAGAT AAAAAATAAA
CGAGGTGGAT GGGTGGTTGA TGAAACACCA GACACATTGA GGCTTCTTCG ATATCCGAAT
CAAGAAAATG TTATGCAGAC TTTGCTTAAA GCTCTGGTCT TTGATTATCT GACATATGGT
AATGCGTATG TTGTCTGTAT TCGAAATCCT GATGGTTCTA CCCAAAAACT TGTTCACGTC
AAAGCTAATG GCGTCACGGT AAGAAGAAAT GAAAAGGGCG AGCTTAGATA TACGGCATCG
TCTAAAATGT TCATTGGTGA AAGAACCTCT GTAAGATCAG AAGGACCCGA GCAAGAGACT
AGATCCATTA GTAATGAAGA CATGATCCAT ATTAAGGATT TCTCAGTCTA TGGTGGGGCA
GTAGGAACGT CTATTGTTGA CTATGCCAGA GAGGTTTTTG GGTTGACCAT TGCTGCTCAA
GAAACAGCAG CACGGACCTT CAACAATGGT GGGGCTCTAC AAGGGTACTG GAAAACCACA
AACGCGAAGG CCGGGAAGAA AACAACTGAG AACGTCCAAG CCGATCTTCA GAATATCATC
GGAGGTGTTT CAAACTCGGG AAAAATCTCG GTTATCAACG ATATGGATTA TGTACCCACG
AGTATGTCAC CACAGGCTCT TCAACTGATT GAAGCTCGGA ATCAGATGAC TCTAGAGATT
GCTCGTCTGT TTCGTGTTCC ACTCCACAAG CTGGGGATGT CTGAAACCGA CAAGGCTGCA
AACATCGAAC AGCAGGAAAT TTCTTACATA AACGATACAT TAAAAGCTAT TACGAACAAC
ATTGAGTCTT GCTTCAATAG AAAAATATTA TTGGAACGTG ATATAGGAAT AAAAAGGTTT
AAATTTGATT TTGAATCTAT GACTTGTCCT GATCTTCAAA CAAGATCTTT GGCTTATACG
ACATTAGTTG ACCACGGGCT CATGACACCT GGATTTGCGG CAAATAAGCT TGGGATTCCT
GTTCCAGAAG ACGAGGAATA TGCTGATTCA TATCGCGTTC CTCTTAATAT GGGAACGATT
GGTGCGAATG AGGATATTCC GTTGTCATTA CAGACACCTG AACCTGAAAA AAAGGTCAGA
AAGAGAACAC AAAAAAAGGA GGCATAA
 
Protein sequence
MKFQTTQLDG ILSSFKSAIN SYVEPPINPG ISGFIGMMPT ASGISINVEN MLQSSAVSSC 
LRIIIDDIGK LDINLEIKNK RGGWVVDETP DTLRLLRYPN QENVMQTLLK ALVFDYLTYG
NAYVVCIRNP DGSTQKLVHV KANGVTVRRN EKGELRYTAS SKMFIGERTS VRSEGPEQET
RSISNEDMIH IKDFSVYGGA VGTSIVDYAR EVFGLTIAAQ ETAARTFNNG GALQGYWKTT
NAKAGKKTTE NVQADLQNII GGVSNSGKIS VINDMDYVPT SMSPQALQLI EARNQMTLEI
ARLFRVPLHK LGMSETDKAA NIEQQEISYI NDTLKAITNN IESCFNRKIL LERDIGIKRF
KFDFESMTCP DLQTRSLAYT TLVDHGLMTP GFAANKLGIP VPEDEEYADS YRVPLNMGTI
GANEDIPLSL QTPEPEKKVR KRTQKKEA