Gene Gdia_3444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_3444 
Symbol 
ID6976896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp3772480 
End bp3774189 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content61% 
IMG OID643392965 
Productprotein of unknown function DUF1078 domain protein 
Protein accessionYP_002277784 
Protein GI209545555 
COG category[N] Cell motility 
COG ID[COG1749] Flagellar hook protein FlgE
[COG4786] Flagellar basal body rod protein 
TIGRFAM ID[TIGR03506] fagellar hook-basal body proteins 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.76085 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGTTT TCAACGCCCT TTCGACGGCT GTCAGCGGGA TCGACGCGCA GTCGACGGCC 
TTCACGAACC TCAGTAACAA CATCGCCAAC AGCCAGACCG TCGGCTACAA GGCCGAATCG
ACGTCCTTCC AGGATTTCGT GGCCGGATCG CTGACGTCGA GCACGGCGTC CAGCGATATT
TCGGATTCGG TGGCTGCCGT CGACGTGCAG AATGTCGGCG CACAGGGGAC GGCCTCGGCC
AGCACCGATA CCCTGGCCAT GGCCATCAAC GGAAACGGGC TCTTCGACGT GTCCGAGGAG
ACAGGCCAGG CGACGTCCGG CACGACGCAG TTCGAGAATA CGCAGTACTA CACACGAAAC
GGCGAGTTTT ATGAGAACAA CGAGGGCTAC CTGGTCAACA CGACCGGCTA TTACCTTGAC
GGCTACATGG CTGACAGCAA TGGTTCCCTG GGAAACACCC TGACCCAGAT CAACGTCGCC
AACGTCTCCT TCCGCCCGAC GGAAACGACG ACCATTACGC AATCCGCCGC CGTGGGCACG
ATCCCGAGTG ATTCGACCTC GTATACAGCC CAGTCGTATT CGACATCTCC GGTCACGACG
TATGATGCCG ATGGCAACGC CTCCAAGGTC GCACTGACCT GGACCCAGAG TTCGACCAAC
CCTCTGGTCT GGACGGTCAG CGCCTATGAT GCCGGCGGCA CCGGCAAGGT TGCCTCGAAC
AGTTTTGAGG TGACGTTTGA CAGCAGCGGT GATCTGGCTT CCGTCACGGG CACCAGCGAT
GGCTCCAGTT ATACGTCTTC GACGTCCAGC GGTGCCTCGG TTGATCCGAC CATTACGCTG
ACCTCCAATG GCGTTGCGCA GACGATCCGT CTTGATCTCG GCACGATTGG CGGAACCAGC
GGCACGACGA TGGCGGCGTC CAGCGGTACG GCAAGCGCGA GTGGGGTGAC CAGCCTGTCG
GCGTCAGGGA CAGCGCTCTC GATGGCGACG ACCACGCTGG GTACGACCAC CGGATCGGGG
CAGAGCTATA TGACGGCGCC GACGGACGTC AACAGCGTGC CCGTGTCGGC CGTGTGGAGC
CAGACATCTG CCAATCCTTC GACGTGGTCG GTTTCGTTGG TCGATCCGTA TGGTGGTTCC
GACGTCAGTT CAGATACCTA CAGCGTCGTT TTCAATTCCA ATGGCACGGC GCAAACGGTT
ACTGATACGA CGACTGGCGC GACGACCACG CTGTCCAGCC TGAGCGCGAC AGTTAACGGT
AAGGCCTACA CCCTGGATGC CAGCGCGGCC TCTTTGTCCA CGACCGCGCT GACCACCAAT
ACGGCGCTGA CCAGCGACAG CGTGACCAGC GGCACCTACG AGGGGGCCGA AATCGAGAGC
GACGGCTCCG TCATGGCCGA GTTCAGCAAC GGCGACACGC AGTTGATCGG CAAGGTCGCG
CTCAGCACGT TTGCCAATGT CGATGGCCTG AATGCGGTCA CCGGCCAGGC TTATACCGCC
ACGGCGGCAT CCGGCGCGGC GCAGACGGGC ACCGTCGGGT CGAATGGAAC GGGAACGCTG
GAAGTCGGCT ATGTCGAATC CTCGACGACC GACCTGACCA GCGATCTGTC CGCCCTGATC
GTGGATCAGG AAGCGTACTC GGCCAATACC AAGGTCGTCA CGACTGCTGA TGACCTGCTC
CAGGCCACCA TCTCGATGAA GCAGGGCTGA
 
Protein sequence
MSVFNALSTA VSGIDAQSTA FTNLSNNIAN SQTVGYKAES TSFQDFVAGS LTSSTASSDI 
SDSVAAVDVQ NVGAQGTASA STDTLAMAIN GNGLFDVSEE TGQATSGTTQ FENTQYYTRN
GEFYENNEGY LVNTTGYYLD GYMADSNGSL GNTLTQINVA NVSFRPTETT TITQSAAVGT
IPSDSTSYTA QSYSTSPVTT YDADGNASKV ALTWTQSSTN PLVWTVSAYD AGGTGKVASN
SFEVTFDSSG DLASVTGTSD GSSYTSSTSS GASVDPTITL TSNGVAQTIR LDLGTIGGTS
GTTMAASSGT ASASGVTSLS ASGTALSMAT TTLGTTTGSG QSYMTAPTDV NSVPVSAVWS
QTSANPSTWS VSLVDPYGGS DVSSDTYSVV FNSNGTAQTV TDTTTGATTT LSSLSATVNG
KAYTLDASAA SLSTTALTTN TALTSDSVTS GTYEGAEIES DGSVMAEFSN GDTQLIGKVA
LSTFANVDGL NAVTGQAYTA TAASGAAQTG TVGSNGTGTL EVGYVESSTT DLTSDLSALI
VDQEAYSANT KVVTTADDLL QATISMKQG