Gene Ava_C0048 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAva_C0048 
Symbol 
ID3677839 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnabaena variabilis ATCC 29413 
KingdomBacteria 
Replicon accessionNC_007412 
Strand
Start bp71562 
End bp72980 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content44% 
IMG OID637715132 
Producthypothetical protein 
Protein accessionYP_320326 
Protein GI75812709 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.217357 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTAG GTTTTTATCC AGTACTAGGT AAGAGTGATT TTCTCAAAAT CAAGGGTGAA 
AAAATCCCGA TTTGGCACTT GCTTGAATAT CAGCCTGCGG GCTGGCTATA CTCACTCGCT
ACAAAAGCAG AAATTGTTCC AGATAGTCTG ATTATTCATG ACTGTGGATC GTTTAACTAC
CGTGACCAAG ATATTCCTAC TCTCAACGGT AAATATGTTG ATGCCCACTG GTCTATTCAT
AGATATCGAG AACGTTCAAA AGTCGGTGAT ATTATTGTTT GTCCTGACCA TTTACTTGTA
GGAGAAAATA TTAGAGAACG GCACGAATAT AACCTTCAAC AAGCAAAAAC ATTTATCCAA
CTTGCTAAAA GTTATGTTCC TAATAGAATC CCCCTAGCAG TCATCCACGG GCAGTCTCTC
AGTGAACGCC TTGAAGTAGC AAAATATCTT CTGGGACTAG GATACCGCCA TCTTGGCATC
GGTGGCCTTG TTCCTCAGGC GAGGGAGTAT TCAACTAACT TGCACATTAT CAAAACAATC
ACTGAAGTTG TGCGTTCACG AAGTGACTCG GAACGAGTAT CGCCGTTCGC GAAGCGTGTC
CGACAGGACA AGGCGGGCGT TTCGCCATCG CGCAGCGACT CCGCAGGAGT ATCGCACGAG
CCAAATGTTC ATCTACACGT TTTTGGACTA TGTTCGCCCC AGTATGCCAA AGCTTTTACT
CAGATGGGAT TATCCTTTGA TGGTTCAACT TTTATCCGAG AAGGACTAGG CGGTGGGATG
TTCGTCAGCA ATGAGGAAAA ATTAATTCGG ATGCCAGCCT ACTGCACGCC AAAGTGCAAC
TGTAATGTTT GCAGGGTTCT TAACCGACAT AGGATTGACC CTCGGTTAAC AAACAAAGGA
CGGACGCATA CTATGGGAAG AATCGCCCAC AACCTCAATC TGGCGATCGG CACTTACAGA
AAATTCATTC CCAAGGAAAA AATCTACCTG GTTGCTGGAT GTGGCAAGCA GCTTCCCTAC
TCTGCTGCCG CCAAAGACTT ATATTGTTCG CAGCACTTTC AAGCTTGCCG TCGCTACGTA
GAAGAAAAAG AATCAAGATG GTATATCCTG TCACCTTTAC ATCAAGTTCT TAACCCAGAA
ACAATCATCA AGCCCTACGA TAAATCCCCC TATTCCTTAT CTCATCAAGA ACGTATACTA
TGGGCGGAAC AAGTAGCAGA AAACCTTATA CAAGTTGCGT CTTTGGAAAT AGAGTTTGTA
TTCTTGACGG GCAAGCTTTA CAGACAGGAG GTAACACCCA TTTTACAAGC GAAGGGATAC
GAGACAAAAG TACCCATGCA GCACCTAGCC ATTGGACAAC AACTTGCTTG GATAAAGAAG
GAACTGGAGC AAGAAAAACA GCTTGTTTTA AATATTTAA
 
Protein sequence
MKLGFYPVLG KSDFLKIKGE KIPIWHLLEY QPAGWLYSLA TKAEIVPDSL IIHDCGSFNY 
RDQDIPTLNG KYVDAHWSIH RYRERSKVGD IIVCPDHLLV GENIRERHEY NLQQAKTFIQ
LAKSYVPNRI PLAVIHGQSL SERLEVAKYL LGLGYRHLGI GGLVPQAREY STNLHIIKTI
TEVVRSRSDS ERVSPFAKRV RQDKAGVSPS RSDSAGVSHE PNVHLHVFGL CSPQYAKAFT
QMGLSFDGST FIREGLGGGM FVSNEEKLIR MPAYCTPKCN CNVCRVLNRH RIDPRLTNKG
RTHTMGRIAH NLNLAIGTYR KFIPKEKIYL VAGCGKQLPY SAAAKDLYCS QHFQACRRYV
EEKESRWYIL SPLHQVLNPE TIIKPYDKSP YSLSHQERIL WAEQVAENLI QVASLEIEFV
FLTGKLYRQE VTPILQAKGY ETKVPMQHLA IGQQLAWIKK ELEQEKQLVL NI