Gene Gdia_0563 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0563 
Symbol 
ID6973960 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp626865 
End bp628619 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content42% 
IMG OID643390095 
ProductTerminase 
Protein accessionYP_002274971 
Protein GI209542742 
COG category[R] General function prediction only 
COG ID[COG4626] Phage terminase-like protein, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.126065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000000000223198 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAATATG AGGAAATCTA TAATAATTTA GATCCACTAT CAAAACAGCA ATACTCTTAT 
CCATATAAGG TTCTAAATGG AACTCTTGGT ACAGAGGTAT GTGAACTCAC CAGACTCGCA
TGTGAAAGAT CATTCAGGGA TTTTGAACTT CCTGATTTCT ATTATGATCC AACCGATCCA
GAACGGTTCC GTTTTCTGAC GACAAAGCTG GTCTATCTTG CTGGTGTAGG AAGTGTCGGC
GGTACGCATG TACAAATCGA AGACTGGCAG TTGTGGTATT TCTCGCAACT TCTTGGATGG
AAAAATGCGA ATGGCAGAAA CAAGGGAAAA CGTAGATTCA GAAAAGGATC ATTGTGGGTT
GCTCGAGGAA ATAACAAAAC CGGTGCCGGA GCTTGGTTCT GCCTTTATAT GTTGCTCTGC
GATGGAGAAT GGGACCCGCA GGTATATTCA GTCGGGGTAG ATCGCGATCA GGCGTCTCAT
ACATTCAAAG CTGCAGCAAA TCAGATCGAG ACGAACCCAA AGTTATTCAA GGCATTGGGG
GCACAAGTCT ATTCCAAATT CATAAAGGGA ATACAAAACT ACAAAGGCGA AAATAAACTC
GGTGTTTTCA AAGCACTTGC TCGAGGTGCT TCAAAGATGA ATGGACTGAA TATCCATTTC
GCTTTCCTTG ACGAGCTACA CGCTTATCCC GATCGATTAA CCTATGATGT CATCGTGTCA
GGTGCCAAAA AGCGAGATCA GTCTCTGGTT CTGGCCGCAT CAACTGCCGG TCTGAATCTC
GATAGTTTCG GATATGAAGA TTATTGTTAT GCCCGGTCAG TTCTACGTCA GGAGTTTGAT
CAGCAGGATG AGCAGTTCTT CTCCTGTGTA TGGGAAGCTG ATGAAGGAGA TGATCCTTAC
TCCGAGGCGA CCTGGGAGAA AGCCAATCCA TGCTGGAATT CTGCCATTGA TCAATTGTCC
TTCAGAGCAG AGGCAGCTTC AGCCAAGAGG ATTCCATCCA AGCGAAGAGA ATTCTTCACC
AAAAACCTGA ACCAATGGCT CTCGTCCGGA ACCACCTGGC TAGATATGGA TGCGGTCAAG
GCGTGCTATG ATCCAGACAT AGAGGAAGAT GATGATTATG ATTTCGGAAT TACCGGGATA
GATCTTGGCT CAAGATCGGA TCTTTGTGTT TATACAAACG TATTCGTAAA CACAATCGAT
GAGCAACTAC ATTATTATGT TTTTCCTCAT CCTTATACAT CAGAAGGATT CTTGGAGAAG
AATATTAGTT CTCAGTTTCG GGCATGGCAG AACGATGGAT GGCTTACTGT CCATAAAGGA
AATGCGGTGT CATCAATCAG TTTCCAAAAG GATCTTTTGG AAAATTACGA GAACCTAGAT
ATCCTCGAGT ATGCCTTCGA TAGAAATCAG GCGAATTATA CTATGGAAAC CTGCTCTGAA
GAAGGTATTG AAGTCATATC CATAGGACAA AATGCCGAAA CATTATCGGA AGCAACATCA
GAGTTTGAAA TAGCAATTCT TGAAAATAGA ATACATTTCA AAAATCCCAT GTTCCTACAT
CATTGTGCCA ATAGTCATAT ATTCACAACG ATTGACGGAT ATATGAAGCC AATAAAAGAA
TCGCGGAATT CAAACAATAA AATCGACATC GTAGCTTCAA CTGTCAATGC CATCGCCAGA
TGCCTATGGA ACCAAAGCAA TCAGGTTATG GCCCCAGGTG TTATTGCGTC CATTCAAATT
TTAAATAAGA GATAA
 
Protein sequence
MKYEEIYNNL DPLSKQQYSY PYKVLNGTLG TEVCELTRLA CERSFRDFEL PDFYYDPTDP 
ERFRFLTTKL VYLAGVGSVG GTHVQIEDWQ LWYFSQLLGW KNANGRNKGK RRFRKGSLWV
ARGNNKTGAG AWFCLYMLLC DGEWDPQVYS VGVDRDQASH TFKAAANQIE TNPKLFKALG
AQVYSKFIKG IQNYKGENKL GVFKALARGA SKMNGLNIHF AFLDELHAYP DRLTYDVIVS
GAKKRDQSLV LAASTAGLNL DSFGYEDYCY ARSVLRQEFD QQDEQFFSCV WEADEGDDPY
SEATWEKANP CWNSAIDQLS FRAEAASAKR IPSKRREFFT KNLNQWLSSG TTWLDMDAVK
ACYDPDIEED DDYDFGITGI DLGSRSDLCV YTNVFVNTID EQLHYYVFPH PYTSEGFLEK
NISSQFRAWQ NDGWLTVHKG NAVSSISFQK DLLENYENLD ILEYAFDRNQ ANYTMETCSE
EGIEVISIGQ NAETLSEATS EFEIAILENR IHFKNPMFLH HCANSHIFTT IDGYMKPIKE
SRNSNNKIDI VASTVNAIAR CLWNQSNQVM APGVIASIQI LNKR