Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Gdia_3444 |
Symbol | |
ID | 6976896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Gluconacetobacter diazotrophicus PAl 5 |
Kingdom | Bacteria |
Replicon accession | NC_011365 |
Strand | + |
Start bp | 3772480 |
End bp | 3774189 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 643392965 |
Product | protein of unknown function DUF1078 domain protein |
Protein accession | YP_002277784 |
Protein GI | 209545555 |
COG category | [N] Cell motility |
COG ID | [COG1749] Flagellar hook protein FlgE [COG4786] Flagellar basal body rod protein |
TIGRFAM ID | [TIGR03506] fagellar hook-basal body proteins |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.76085 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCAGTTT TCAACGCCCT TTCGACGGCT GTCAGCGGGA TCGACGCGCA GTCGACGGCC TTCACGAACC TCAGTAACAA CATCGCCAAC AGCCAGACCG TCGGCTACAA GGCCGAATCG ACGTCCTTCC AGGATTTCGT GGCCGGATCG CTGACGTCGA GCACGGCGTC CAGCGATATT TCGGATTCGG TGGCTGCCGT CGACGTGCAG AATGTCGGCG CACAGGGGAC GGCCTCGGCC AGCACCGATA CCCTGGCCAT GGCCATCAAC GGAAACGGGC TCTTCGACGT GTCCGAGGAG ACAGGCCAGG CGACGTCCGG CACGACGCAG TTCGAGAATA CGCAGTACTA CACACGAAAC GGCGAGTTTT ATGAGAACAA CGAGGGCTAC CTGGTCAACA CGACCGGCTA TTACCTTGAC GGCTACATGG CTGACAGCAA TGGTTCCCTG GGAAACACCC TGACCCAGAT CAACGTCGCC AACGTCTCCT TCCGCCCGAC GGAAACGACG ACCATTACGC AATCCGCCGC CGTGGGCACG ATCCCGAGTG ATTCGACCTC GTATACAGCC CAGTCGTATT CGACATCTCC GGTCACGACG TATGATGCCG ATGGCAACGC CTCCAAGGTC GCACTGACCT GGACCCAGAG TTCGACCAAC CCTCTGGTCT GGACGGTCAG CGCCTATGAT GCCGGCGGCA CCGGCAAGGT TGCCTCGAAC AGTTTTGAGG TGACGTTTGA CAGCAGCGGT GATCTGGCTT CCGTCACGGG CACCAGCGAT GGCTCCAGTT ATACGTCTTC GACGTCCAGC GGTGCCTCGG TTGATCCGAC CATTACGCTG ACCTCCAATG GCGTTGCGCA GACGATCCGT CTTGATCTCG GCACGATTGG CGGAACCAGC GGCACGACGA TGGCGGCGTC CAGCGGTACG GCAAGCGCGA GTGGGGTGAC CAGCCTGTCG GCGTCAGGGA CAGCGCTCTC GATGGCGACG ACCACGCTGG GTACGACCAC CGGATCGGGG CAGAGCTATA TGACGGCGCC GACGGACGTC AACAGCGTGC CCGTGTCGGC CGTGTGGAGC CAGACATCTG CCAATCCTTC GACGTGGTCG GTTTCGTTGG TCGATCCGTA TGGTGGTTCC GACGTCAGTT CAGATACCTA CAGCGTCGTT TTCAATTCCA ATGGCACGGC GCAAACGGTT ACTGATACGA CGACTGGCGC GACGACCACG CTGTCCAGCC TGAGCGCGAC AGTTAACGGT AAGGCCTACA CCCTGGATGC CAGCGCGGCC TCTTTGTCCA CGACCGCGCT GACCACCAAT ACGGCGCTGA CCAGCGACAG CGTGACCAGC GGCACCTACG AGGGGGCCGA AATCGAGAGC GACGGCTCCG TCATGGCCGA GTTCAGCAAC GGCGACACGC AGTTGATCGG CAAGGTCGCG CTCAGCACGT TTGCCAATGT CGATGGCCTG AATGCGGTCA CCGGCCAGGC TTATACCGCC ACGGCGGCAT CCGGCGCGGC GCAGACGGGC ACCGTCGGGT CGAATGGAAC GGGAACGCTG GAAGTCGGCT ATGTCGAATC CTCGACGACC GACCTGACCA GCGATCTGTC CGCCCTGATC GTGGATCAGG AAGCGTACTC GGCCAATACC AAGGTCGTCA CGACTGCTGA TGACCTGCTC CAGGCCACCA TCTCGATGAA GCAGGGCTGA
|
Protein sequence | MSVFNALSTA VSGIDAQSTA FTNLSNNIAN SQTVGYKAES TSFQDFVAGS LTSSTASSDI SDSVAAVDVQ NVGAQGTASA STDTLAMAIN GNGLFDVSEE TGQATSGTTQ FENTQYYTRN GEFYENNEGY LVNTTGYYLD GYMADSNGSL GNTLTQINVA NVSFRPTETT TITQSAAVGT IPSDSTSYTA QSYSTSPVTT YDADGNASKV ALTWTQSSTN PLVWTVSAYD AGGTGKVASN SFEVTFDSSG DLASVTGTSD GSSYTSSTSS GASVDPTITL TSNGVAQTIR LDLGTIGGTS GTTMAASSGT ASASGVTSLS ASGTALSMAT TTLGTTTGSG QSYMTAPTDV NSVPVSAVWS QTSANPSTWS VSLVDPYGGS DVSSDTYSVV FNSNGTAQTV TDTTTGATTT LSSLSATVNG KAYTLDASAA SLSTTALTTN TALTSDSVTS GTYEGAEIES DGSVMAEFSN GDTQLIGKVA LSTFANVDGL NAVTGQAYTA TAASGAAQTG TVGSNGTGTL EVGYVESSTT DLTSDLSALI VDQEAYSANT KVVTTADDLL QATISMKQG
|
| |