Gene Francci3_3777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3777 
Symbol 
ID3906061 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4525813 
End bp4527243 
Gene Length1431 bp 
Protein Length476 aa 
Translation table11 
GC content71% 
IMG OID637881103 
Productgamma-aminobutyraldehyde dehydrogenase 
Protein accessionYP_482857 
Protein GI86742457 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID[TIGR03374] 1-pyrroline dehydrogenase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00255775 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGGATGCGA TCTTCGCCAA TCTCATCGGT GGCGTCTCCA CCCCGGCCGT CGATGGCGAG 
ACAATGTCCA TCCTGAACCC GTCGACGGCG GACCTGCTGG CGCAGGCACC CCGGTCGGGG
CCGGCCGACG TCGACGCGGC CGTGGCGGCC GCGAGCGCGG CGTTCGAGGG CTGGCGGGAC
ACCACGCCCG CGCAGCGGTC CCGGGCACTG CTGCGGGTGG CCGACGCGCT GGAGGACCGG
GCCGACGAGG TCGCGGCGGT GGAGAGCGCG AACACCGGCA AACCGCTGCG GCTCACCGTG
GACGAGGAGG TCGGGCCGTC CGCCGACCAG ATCCGGTTCT TCGCCGGGGC CGCGCGGCTG
CTCGAGGGCC GGTCGGCCGG CGAGTACCTG GCGGGGTACA CCAGTTATGT CCGCCGCGAG
CCCATCGGGG TCTGTGCCCA GGTCACGCCC TGGAACTACC CGCTGATGAT GGCGGTCTGG
AAGGTCGCGC CGGCGATCGC GGCGGGCAAC ACCGTCGTGC TCAAGCCCTC CGACACCACC
CCGGCGTCCA GCCTGCTGCT TGCCGAGATC GCCGCGGAGT TCCTCCCGCC CGGCGTGCTC
AACGTGGTGT GTGGTGATCG TGACACCGGA CGCGCGCTGG TGGCCCATCC GGGCCCGGCG
ATGGTCTCGG TGACCGGCAG CGTGCGGGCC GGGATGGAGA TCGCCCGGGC CGCGGCGGAC
TCCTTGAAGA GGGTGCACCT CGAACTCGGG GGCAAGGCGC CGGTCATCGT CTTCGACGAC
GTGGATCCGG CGGTGGTGGC GCGGCAGATC GCCGAGGCCG CGTACTTCAA CGCCGGCCAG
GACTGCACGG CGGCGACCAG GGTGCTGGCC GCGCCGGGCG TCCACGACGA GCTCGCAGCC
GCCCTGGCCG AGGCGGCCGG GAAGACCGCG ACTGGATCGC CGGAGGAGCC CGACGTCGAC
TACGGGCCCC TCAACAACGC GGGCCAACTG GCCCGGGTGA GCGGTTTCGT GGAGCGGGCA
CCTGAACATG CCGAGGTGCT CGCCGGCGGT GCTCCGCTGG ACCGGGCGGG CTACTTCTAC
CCGGCCACGG TCGTCTCGGG TCTGCGCCAG GACGACGAGC TCATCCAGTC CGAGGTGTTC
GGCCCGATCA TCACGGTGCA GCGGTTTGAC TCGGAGGACA CCGCGGTTGC CTGGGCCAAC
GGGGTCGAAT ACGGCCTGGC CTCCAGCGTC TGGACCCGCG ATCACGGCCG GGCCCTGCGG
GTCGCCCGCC GGCTGGACTT CGGCTGCGTA TGGATCAACA CGCACATCCG GCTCGTGGCG
GAGATGCCGC ACGGTGGGTT CAAGAAGAGC GGGTACGGCA AGGATCTGTC GGTCTACGGC
CTGGAGGACT ACACCAGGAT CAAGCATGTG ATGAGCAACA TCGAATTCTG A
 
Protein sequence
MDAIFANLIG GVSTPAVDGE TMSILNPSTA DLLAQAPRSG PADVDAAVAA ASAAFEGWRD 
TTPAQRSRAL LRVADALEDR ADEVAAVESA NTGKPLRLTV DEEVGPSADQ IRFFAGAARL
LEGRSAGEYL AGYTSYVRRE PIGVCAQVTP WNYPLMMAVW KVAPAIAAGN TVVLKPSDTT
PASSLLLAEI AAEFLPPGVL NVVCGDRDTG RALVAHPGPA MVSVTGSVRA GMEIARAAAD
SLKRVHLELG GKAPVIVFDD VDPAVVARQI AEAAYFNAGQ DCTAATRVLA APGVHDELAA
ALAEAAGKTA TGSPEEPDVD YGPLNNAGQL ARVSGFVERA PEHAEVLAGG APLDRAGYFY
PATVVSGLRQ DDELIQSEVF GPIITVQRFD SEDTAVAWAN GVEYGLASSV WTRDHGRALR
VARRLDFGCV WINTHIRLVA EMPHGGFKKS GYGKDLSVYG LEDYTRIKHV MSNIEF